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Preface. 


Engineers and scientists play a significant role in the modern world. They are responsible for research, 
design and development of diversified products and the manufacturing processes that make these products. 
Statistics is the branch of applied mathematics that deals with variability and its effect on decision making. 


Statistics refers to the field or discipline of study. Statistics is a group of methods that are used to 
collect, analyze, present, and interpret data and to make decisions. Decisions made by using statistical 
methods are called educated guesses. Statistics has two areas: theoretical or mathematical and applied 
statistics. Theoretical or mathematical statistics deals with the development, deviation, and proof of 
statistical theorems, formula, rules, and laws. Applied statistics concerns the applications of those theorems, 
formulas, rules and laws to solve real-world problems. This book deals with applied statistics. 


Applied statistics is divided into two branches: descriptive statistics and inferential statistics. 
Descriptive statistics involves methods for organizing, displaying, and describing data by using tables, 
graphs and summary measures. The collection of all elements of interest is called a population in statistics. 
The selection of a few elements from this population is called a sample. Statistics deals with making decisions, 
inferences, predictions, and forecasts about populations based on results obtained from samples. Statistics 
that deals with decision-making procedures is called inferential statistics. This branch of statistics is also 
known as inductive reasoning or inductive statistics. Inferential statistics consists of methods that employ 
sample results to help make decisions or predictions about a population. 


Probability, which measures the likelihood that an event will occur, is an important part of statistics. It 
is the basis of inferential statistics, where decisions are made under conditions of uncertainty. Probability 
theory is used to evaluate the uncertainty involved in those decisions. Probability statements are about 
occurrence or non-occurrence of a certain event under certain conditions. 

Each chapter in this book consists of a concise but thorough fundamental statement of the theory, 
principles and methods, followed by numerous illustrative worked examples. A large number of sample 
unworked problems for students’ practice, to amplify and extend the theory are also included at the end of 
each chapter. 

The course material presented in the book has evolved from the author’s experience in teaching this 
course to the undergraduate and graduate students of science and engineering at universities in Canada, 
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US, and the industry during the last 35 years. Probability and Statistics is intended as a textbook for 
undergraduate and graduate students in mathematics, science and engineering curriculum. This book also 
serves a sound basic background for solving many practical probability and statistics problems or for 
pursuing more advanced or specialized studies. 

I sincerely hope that the final outcome of this book helps the students in developing an appreciation 
for the topics of probability and statistics. Answers to all the end of chapter problems are given in 
the book. 


Rao V. Dukkipati 
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CHAPTER 


Numerical 
Descriptive Measures 


This chapter introduces the areas of statistics, basic concepts and definitions, population and sample, types 
of variables, graphical representations of qualitative and quantitative data, numerical summary measures, 
measure of central tendency for ungrouped data (mean, median and mode), measures of dispersion tor 
ungrouped data (range, variance and standard deviation), mean, variance and standard deviation for grouped 
data, measures of position including quartiles, percentiles, box-and-whisker plot. skewness and kurtosis. 


1.1. INTRODUCTION | emer: 


Statistics refers to the field or discipline of study. Statistics is a group of methods that are used to collect. 
analyse, present, and interpret data and to make decisions. Decisions made by using statistical methods 
are called educated guesses. Statistics has two areas, theoretical or mathematical statistics and applied 
Statistics as shown in Fig.1.1. Theoretical or mathematical statistics deals with the development, derivation, 
and proof of statistical theorems, formula, rules, and laws. Applied statistics concerns the applications of 
those theorems, formulae, rules and laws to solve real-world problems. This chapter deals with applied 
Statistics. 

Applied statistics is divided into two branches, Descriptive statistics and Inferential statistics. 
Descriptive statistics involves methods for organising, displaying, and describing data by using tables, 
graphs and summary measures. The collection of all elements of interest is called a population in statistics. 
The selection of a few elements from this population is called a sample. Statistics that deals with making- 
decision, inferences, predictions, and forecasts about populations based on results obtained from samples. 
Statistics that deals with decision-making procedures is called inferential statistics. This branch of statistics 
is also known as inductive reasoning or inductive statistics. Inferential statistics consists of methods that 
employ sample results to help make decisions or predictions about a population. 


Probability, which measures the likelihood that an event will occur, is an important part of statistics. It 
is the basis of inferential statistics, where decisions are made under conditions of uncertainty. Probability 
theory is used to evaluate the uncertainty involved in those decisions. Probability statements are about 
occurrence or non-occurrence of a certain event under certain conditions. 
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STATISTICS 


THEORETICAL STATISTICS OR APPLIED 
MATHEMATICAL STATISTICS STATISTICS 


Application of theorems, formulae, 
rules and laws to solve real world 
problems 


Development 
Derivation 


Proof of theorems 
Formulae 
Rules and laws 


DESCRIPTIVE INFERENTIAL 
STATISTICS —: STATISTICS 


Methods for organising,displaying, 
describing data by tables, graphs 
and summary measures 


Deals with decision-making 
procedures, methods that employ 
sample results to help make 
decisions about a population 


Fig. 1.1: Areas of statistics 


1.1.1 Population and Sample 


A population consists of all elements: individuals, items, or objects, whose characteristics are being studied. 
The population that is being studied is also called the target population. The collection of a few elements 
selected from a population is called a sample. Thus, a portion of the population selected for study is referred 
to as a sample. The collection of information from the elements of a population or a sample is called a 
survey. A survey that includes each and every element of the population is called a census. A survey 
conducted on a sample is called a sample survey. A sample that represents the characteristics of the 
population as closely as possible is called a representative sample. 


A sample drawn in such a way that each element of the population has same chance of being selected 
is called a random sample. lf the chance of being selected is the same for each element of the population, it 
is called a simple random sample. An element or member of a sample or population is a specific subject or 
object about which the information is collected. A variable is a characteristic under study that assumes 
different values for different elements. In relation to a variable, the value of a constant is fixed. The value of 


a variable for an element is called an observation or measurement. A data set is a collection of observations 
on one or more variables. 


1.1.2 Types of Variables 


A variable that can be measured numerically is called a quantitative variable. The data collected on a 
quantitative variable are known as quantitative data. Quantitative variable is classified as either discrete 
variable or continuous variable. A variable whose values are continuous is called a continuous variable. 
A discrete variable can assume only certain values with no intermediate values. On the other hand, a variable 
that can assume any numerical value over a certain interval or intervals is called a continuous variable. 
The time taken to complete a trip to a corner store is an example of a continuous random variable because it 
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can assume any value, say, between one and two hours. A variable that cannot assume a numerical value 
but can be classified into two or more non-numeric categories is called a qualitative or categorical variable. 
The data collected on such a variable are known as qualitative data. The types of variables and types of 


data are classified as shown in Figs.1.2 (a) and (b). 
Quantitative 
Continuous: Weight, Height, Line width measurements 


Discrete: Number of students, Number of accidents 


Qualitative 

Nominal: Black, Green, Yellow, ... 
Ordinal: High, Medium, Low 
Ranked: 1 2 3 etc. 


Data collected on different elements at the same point in time or for the same period of time are called 
cross-section data. 


RANDOM VARIABLE 
A variable whose value is 
determined by the outcome of a 
random experiment 


CONTINUOUS RANDOM 
VARIABLE 
Arandom variable that can 
assume any value contained in 
one or more intervals 


DISCRETE RANDOM 
VARIABLE 
Arandom variable that 
assumes countable values 


Fig. 1.2(a): Types of variables 


TYPES OF DATA 


Categorical data 


Quantitative 


Fig. 1.2(b): Types of data 


Continuous 


1.1.3 Organising Data 
Data recorded in the sequence in which they are collected and before they are processed or ranked are 


called raw data. 
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1.1.3.1 Qualitative Data 


A frequency distribution for qualitative data lists all categories and the number of elements that belong to 
each of the categories. A relative frequency distribution lists the relative frequencies for all categories. 


frequency of that category 


Relative fi f a cat = LD) 
spine fobs wea ansaid aed sum of all frequencies \ 
A percentage distribution lists the percentages for all categories. 

Percentage = relative frequency x 100 (1.2) 


1.1.3.2. Graphical Representation of Qualitative Data 


The bar graph and the pie chart are two types of graphs used to represent or display qualitative data. A 
graph made of bars whose heights represent the frequencies of representative categories is called a bar 
graph. A pie chart is more commonly used to display percentages, although it can be used to display 
frequencies or relative frequencies. The pie chart represents the total sample or population. The pie chart is 
divided into different portions that represent the percentages of the population or sample belonging to 
different categories. 


Example E 1.1 


The following data give the results of a sample survey of three categories A, B and C. 
A B C c A B A B Cc c A B ‘S A 
A ce A B c A B C c G: C A Cc \C 


(a) prepare a frequency distribution table 

(b) compute the relative frequencies and percentages for all three categories A, B and C 
(c) what percentage of the elements in this sample belongs to category B? 

(d) what percentage of the elements in this sample belongs to categories A or C? 

(e) draw a bar graph of the frequency distribution. 


SOLUTION: 
(a) and (b) 


(c) 20% of the element in this sample belongs to category B. 
(d) 30% + 50% = 80% of the elements in this sample belong to categories A or C. 
(e) The bar graph of the frequency distribution is shown in Fig. E1.1. 
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16 


ot 
nN 


co 


Frequency 


Category 


Fig. E1.1: Bar graph of the frequency distribution 


Example E1.2 


Draw the pie chart for the percentage distribution in Example E1.1. 


SOLUTION: 
The pie chart is shown in Fig. E1.2. 


50% 


Fig. E1.2: Pie chart 


1.1.3.3. Graphical Representation of Quantitative Data 


A frequency distribution for quantitative data lists all the classes and the number of values that belong to 
each class. Data presented in the form of a frequency distribution are called grouped data. The class 
boundary is the midpoint of the upper limit of one class and the lower limit of the next class, while class 


width = upper boundary — lower boundary. 
The class width is also known as the class size. 
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lower limit + per limit (1.3) 


The class midpoint or mark = 5 


largest value — mallest value 


idth = (1.4) 
Approximate class width aes aaeee 
The relative frequencies and percentages for a quantitzive data set are calculated as follows: 
frequency of thi class 
Relative frequency of a class = HequenQigat Classe (1.5) 


sum of all frequencies - > Hg 


Percentage = relative frequenc’ x 100 


A histogram is a graph in which classes are marked onthe horizontal axis and the frequencies, relative 
frequencies or percentages are marked on the vertical axis. Tie frequencies, relative frequencies, or percentage 
are represented by the heights of the bars. In a histogram, te bars are drawn adjacent to each other. 


The most common shapes of histograms are symmttric, skewed, and uniform or rectangular. A 
symmetric histogram is identical on both sides of its centnl point. A skewed histogram is nonsymmetric. 
For a skewed histogram, the tail on one side is longer thanthe tail on the other side. A skewed-to-the-right 
histogram has a longer tail on the right side while a skew:d-to-the-left histogram has a longer tail on the 
left side. A uniform or rectangular histogram has the samefrequency for each class. 


A polygon is another method that can be used to repreent quantitative data in graphical form. A graph 
formed by joining the midpoints of the tops of successive lars in a histogram with straight lines is called a 
polygon. A polygon with relative frequencies marked onthe vertical axis is called a relative frequency 
polygon. In a similar manner, a polygon with percentages narked on the vertical axis is called a percentage 
polygon. 

A cumulative frequency distribution gives the total nunber of values that fall below the upper boundary 
of each class. 


: : cumulativ: frequency 
Cumulative relative frequency = — 1... soe (1.6) 
total observaticis in the data set 
Cumulative frequency = cumulative relatve frequency x 100 a7) 


When plotted on a graph, the cumulative frequencies give a curve that is called an ogive. Thus, an 
ogive is a curve drawn for the cumulative frequency distibution by joining with straight lines the dots 
marked above the upper boundaries of classes at height eaal to the cumulative frequencies of respective 
classes. 


Another technique that is used to represent quantitatie data in condensed form is the stem-and-leaf- 
display. In a stem-and-leaf display of quantitative data, eaa value is divided into two portions: a stem and 
a leaf. The leaves for each stem are shown separately in a dsplay. 


Example E1.3 
ee ———e—E—_ 


Out of 27 randomly selected families in a small town, the ‘ollowing data give the number of children less 
than 18 years of age: 
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(a) construct a frequency distribution table using single-valued classes 

(b) compute the relative frequencies and percentages for all classes 

(c) find the number of families with one or two children less than 18 years of age 
(d) find the number of families with two or three children less than 18 years of age 


(e) draw a bar graph for the frequency distribution. 


SOLUTION: 


(a) and (b) The frequency distribution, relative frequencies and percentages are shown in Table E1.3. 


Table E1.3 


Number of children Relative Cumulative Percentage 
less than 18 years frequency | relative frequency (probability) 


(c) Number of families with one or two children under 18 years of age = 3 + 10 = 13. 
(d) Number of families with two or three children under 18 years of age = 10 + 2 = 12. 
(e) The bar graph for the frequency distribution is shown in Fig. E1.3. 
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Fig. E1.3: Bar graph of frequency distribution 
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Example E1.4 


The frequency distribution of ages of all 60 part-time graduate students in a Master of Science in Mechanical 
Engineering degree program of a university is given below: 


0 [8 
ec 


(a) prepare a cumulative frequency distribution table 
(b) calculate the cumulative relative frequencies and cumulative percentages for all classes 
(c) find the percentage of students who are 31 years of age or older 
(d) draw a ogive for the cumulative percentage distribution 
(e) using the ogive, determine the percentage of students who are 35 years or younger. 
SOLUTION: 
(a) and (b) The cumulative frequency, cumulative relative frequency and percentages are shown in Table 
E14. 
Table E1.4 
Frequency | Cumulative | Cumulative relative | Cumulative 
(years) frequenc frequenc 
21-25 0.366 
26-30 
31-35 
3640 | 8 | 54 
J as 
(c) Percentage of students who are 31 years of age or older = 100 — 58.3 = 41.7%. 
(d) The frequency, cumulative relative frequency and the ogive diagram for the cumulative percentage 
distribution is shown in Fig. E1.4. 
(e) The percentage of a student who is 35 years or younger is about 70%. 


&, 100 
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3 
©o 80 
o 
= 70 
2 60 
® 
=] 
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2 40 
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2 
20 
=! 
= 
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O 0 
20.5 24.5 30.5 35.5 40.5 45.5 
Age (years) 


Fig. E1.4: Ogive diagram for the cumulative percentage distribution 
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Example E1.5 
eee 


Following are the overtime monies earned by 15 employers of a company during the month of August 2008. 
780, 892, 995, 1168, 883, 1041, 994, 1084, 776, 981, 1129, 1067, 932, 655, 642 


Prepare a stem-and-leaf display by arranging the leaves for each stem in an increasing order. 
SOLUTION: 


The stem-and-leaf display and the stem-and-leaf display data are shown in Figs. El.5(a) and E1.5(b) 
respectively. 


6 SD. 42 G42 «= «55 
7 80 76 7 | 76 80 
8 92 83 8 83 o2 
9 95 94 8i 32 9 |32 81 94 9§ 
10 41 84 467 10 | 41 67 84 
11 68 29 11 29s 68 
Fig. E1.5(a): Stem-and-leaf Fig. E1.5(b): Stem-and-leaf display of data 
display of data arranged in increasing order 


12 NUMERICAL SUMMARY MEASURES - 


There are three basic numerical summary (descriptive) measures used to organize and display large data 
sets as shown in Fig.1.3. They are: 


1. Measures of central tendency 
2. Measures of dispersion or scatter and 
3. Measures of position. 
UNGROUPED DATA 


_ NUMERICAL SUMMARY MEASURES 
Measures of position 
Ungrouped data | Range | Mean Standard | Quartiles | 
——- Re . ~ deviation ranks 
Mean | | Mode _| 


Fig. 1.3: Three basic numerical summary measures 


Measures of central tendency | |... Measures of dispersion 


1.2.1 Measures of Central Tendency for Ungrouped Data 


A data set is generally represented by numerical summary measures called the typical values. A measure of 
central tendency gives the center of a histogram or a frequency distribution curve. There are three measures 


of central tendency: the mean, the median, and the mode. 
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1.2.1.1 Mean for Ungrouped Data 
An average is a value which is typical or representative of a set of data. Since such values tend to lie 
centrally within a set of data arranged according to magnitude, averages are also called measures of central 
tendency. 

The mean, also called the arithmetic mean or average, for ungrouped data is obtained by dividing the 
sum of all values by the number of values in the data set. Thus, 


Lx (1.8) 
N 


Mean for population data pw 


Dx (19) 


nh 


Mean for sample data Be 


where J is the population mean, )X the sum of all values, N the population size, X the sample mean, and n 
the sample size. Sometimes the data may contain a few very small or a few very large values. Such values 
are called outliers or extreme values. 


1.2.1.2 Median 


The median is the value of the middle term in a data set that has been ranked in either increasing or decreasing 


th 
order. Median for ungrouped data = the value of the at) term in a ranked data set. If the given data 


set represents a population, replace n by N. If the number of observations in a data set is odd, then the 
median is given by the value of the middle term in the ranked data. Similarly, if the number of observations 
is even, then the median is the average of the values of the two middle terms. 


1.2.1.3 Mode 


The mode for an ungrouped data is the value that occurs with the highest frequency in a data set. A data 
set with only one value occurring with highest frequency has only one mode and such a data set is called 
unimodal. A data set with two values occurring with the same (highest) frequency has two modes. The 
distribution in this case is called bimodal. If more than two values in a data set occur with the same (highest) 
frequency, then the data set contains more than two modes and it is said to be multi modal. It should be 
noted here that a data set with each value occurring only once has no mode. 


1.2.1.4 Empirical Relation among Mean, Median and Mode 


For unimodal frequency curves which are moderately skewed (asymmetrical), the empirical relationship is 
given by 
mean — mode = 3 (mean — median) 
Figures 1.4(a) and (b) show the relative positions of the mean, median and mode for frequency curves 


which are skewed to the right and left respectively. It should be noted here that for symmetrical curves, the 
mean, median and mode and they all coincide. 
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JT L-\ 


Mode Median Mean Mean Median Mode 
(a) Skewed to the right (b) Skewed to the left 


Fig. 1.4: Empirical relationship 


Example E1.6 


The following data gives the number of automobiles owned by each of the 30 families in a city. 


2 a) ie i | Z 4 3 1 ] 
3 Z 2 1 2 3 ps a) 1 3 
4 1 5 4 1 1 1 B) 2 2 


(a) prepare a frequency distribution table for this data set using single-valued classes. 
(b) calculate the relative frequency and percentage distributions. 

(c) find the percentage of the families in this sample having two or three automobiles. 
(d) draw a bar graph for the relative frequency distribution. 


SOLUTION: 
(a) and (b) 


(c) Number of families in this sample who have two or three automobiles = 26.7 + 26.7 = 53.4% 


(d) The bar graph for the relative frequency distribution is shown in Fig. E1.6. 
0.5 


o 
a 


So 
ran) 


o 
rN) 


Relative frequency 


= 
— 


Oo 


| 2 3 4 
Number of automobiles in a family 
Fig. E1.6: Bar graph of relative frequency distribution 
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Example E1.7 
The following are the ages of 15 employees of a manufacturing company. 


65, 38, 52, 27, 24, 45, 49, 50, 26, 37, 44, 52, 41, 60, 39 


Calculate the mean, median and mode. 
SOLUTION: 


DX _ (654+38+52+27+ 24445 +49 +50+ 264374444524 41+ 604 39) 


Me 15 


aes 43.266 
15 7 43-266 years. 


Rearrange the given data in an increasing order gives 


24, 26, 27, 37, 38, 39, 41, 44, 45, 49, 50, 52, 52, 60, 65 


illo ae 
oe ee 


Position of the middle term = 


Hence, the median is the value of the middle term in the ranked data. 

Median = 44 years 
In the data set, 52 occur twice and each of the remaining values occurs only once. 
Therefore, mode = 52. 


1.2.2 Measures of Dispersion for Ungrouped Data 
The mean, median and mode are not sufficient measures to reveal the shape of the distribution of a data 
set. The measures that show the spread of a data set are called the measures of dispersion. There are three 
measures of dispersion range, variance, and standard deviation as shown in Fig. 1.5. 
1.2.2.1 Range 
The range is the difference between the largest and smallest values in the data. 
The range for ungrouped data = largest value — smallest value. 


The range is not a very satisfactory measure of dispersion. 


1.2.2.2 Variance and Standard Deviation 


The standard deviation is the most used measure of dispersion. The formulae for calculating the variance 
and standard deviation are as follows: 


Variance for ungrouped data 


Population variance 


Se (1.10) 
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Sample variance 


oS pee (1.11) 


n—-| ll 
where, 6” is the population variance, s?is the sample variance, (x — 1) or (x — X ) is the deviation of x value 


from the mean, p is the population mean, ¥ is the sample mean, N is the population size, and 7 is the 
sample size. 


The population standard deviation o = {population variance = Vo" (112) 
The sample standard deviation s = ./sample variance = Vs? C113) 


A numerical value such as the mean, median, mode, range, variance, or standard deviation computed 
for a population data set is called a population parameter, or simply a parameter. A summary measure 
calculated for a sample data set is called a sample statistic or simply a statistic. 


_. MEASURES OF DISPERSION 
Blicciasraniau ame 


Range = Largest value — Smallest value 


Population standard deviation 
o = Population variance = Yo? 


Sample standard deviation 


Fig. 1.5: Measures of dispersion 


The following data gives the number of years of service of 15 employees in a manufacturing company. 
5 Oey Get 1h 4, 13, 10, 9820).8,. 19, Tees 
Calculate the range, variance and standard deviation. 


SOLUTION: 
Range = largest value — smallest value = 25 — 4 = 21 years 


p 
en 178 
os 2132 is 2732 - 2112.266 _ ig 434 
: == SS vrcrO~—OCOS . 
ne oe PG 15-1 14 


Sindee ion «= +52 =./619.734 = 24.804 
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Example E1.9 


The following data refers to the pay (in thousands of dollars) of 12 employees at a manufacturing company. 
75 @ 65 49 21 21 18 18 17 16 16 15 


(a) calculate the mean, median and mode for this data 
(b) find the range, variance and standard deviation. 


SOLUTION: 


ll 
iT 
A 
6) 
ws) 
ies) 
GD 
OD 


(a) Mean= ail 
hh 


th 
rie : 
Median is term in a ranked data set. Therefore, 


~_ 


er al 

isn 5 =O) 
21+18 

Median = = $19,500 


Mode = $21,000, $18,000 and $16,000 
(b) Range = largest pay — smallest pay = $75,000 — $15,000 = $60,000 
yy? 22) 19 pg 400° 
Sa aes = 563.1515 
Standard deviation = s = V's? = (563.1515 = 23.73 = $23,730 
1.2.3. Mean, Variance and Standard Deviation for Grouped Data 


1.2.3.1 Mean for Grouped Data 


Sn 


Mean for population data: [1 = a (1.14) 


= Sinf 


Mean for sample data: X = (1.15) 
n 
where, m is the midpoint and f is the frequency of a class. 
1.2.3.2 Variance and Standard Deviation for Grouped Data 
Population variance 
Cum 
2 
frp yin ie 
o = LS (mp? eee (1.16) 


N N 
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Sample variance 


_ a(t) 
Pee” fa 


n-1 n-1| 


(1.17) 


where O° is the population variance, s* is the sample variance, m is the midpoint of a class, N is the population 


size, n is the sample size, 1 is the population mean, X is the sample mean, and f the class frequency. 


Population standard deviation o =o’ (1.18) 
Sample standard deviation s= Vs? (1.19) 
Example E1.10 


Calculate the mean, variance and standard derivation for the following population data. 


0-3 | 4-7 | 8-11 | 12-15 
2 


SOLUTION: 


| Classes | f | m | 
ITs} 


Q-3 d. 
4-7 4 20S) 


121 
sai ais 9.5 1624.5 
ee aioe 2187 


16-19 i 1225.25 
Total [54 | [617 __| 8865.50_| 


Population mean 


(yim fy 617° 

2 = 

Dim? f --<——_ 8865.50- = _gg65.50-7049.796 
N 54 54 


eee = 33.624 


Standard deviation 6 = Vo° = V33.624 =5.798 


1.2.4 Measures of Position 
Here we examine several descriptive measures based on percentiles. Descriptive measures based on 
percentiles are not sensitive to the influence of a few extreme observations. Descriptive measures based on 
percentiles are generally preferred over those based on the mean and standard deviation. 
A measure of position determines the position of a single value in relation to other values in a weenie 
or a population data set. The measures of position considered here are the quartiles. percentiles, and percentile 


ranks as shown in Fig. 1.6. 
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MEASURES OF POSITION 
QUARTILES _ _PERCENTILES 


They are three summary measures that divide a ranked data set into four equal parts. 
Second quartile (Q,) is the 50th Third quartile (Q,) is the 
percentile or the median of a data set 75th percentile 


Interquartile Range 
Fig. 1.6: Measures of position 


PERCENTILE RANKS __ 


First quartile (Q,) is the 
25th percentile 


IQR = Q,- Q, 


1.2.4.1 Quartiles and Interquartile Range 
The most commonly used percentiles are quartiles. Let n denotes the number of observations and if we 
arrange the data in an increasing order, then 


Poe ee n+l 
— the first quartile is at position we 


— the second quartile is the median, which is at position = 


; ae on n+l 
— the third quartile is at position 3 Ga 


Thus, a data set has three quartiles, and denoted by Q,, Q,, and Q,, Q,, is the number that divides the 
bottom 25% of the data from the top 75%; the second quartile, Q, , the median, is the number that divides 
the bottom 50% of the data from the top 50%; the third quartile, Q,, is the number that divides the bottom 
75% of the data from the top 25%. Figure 1.7 shows the positions of the three-quartiles. Figure 1.8 shows 
the quartiles for uniform, bell-shaped, right-skewed, and left-skewed distributions. The difference between 
the third quartile and the first quartile of data set is called the interquartile range (IQR). Interquartile range 
is a preferred measure of variation when the median used as the measure of the center. The interquartile 
range is a resistant measure. IQR gives the range of the 50% of the observations. 


Interquartile range IQR = Q, - QO, 


Each of these portions contains 25% of the 
observations of a data set arranged in increasing order 


Q, Q, Q; 
Fig. 1.7: Quartiles 
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(a) Uniform (b) Bell-shaped 


Q, Q, Q, 
(c) Right-skewed 2 (d) Left-skewed 


Fig. 1.8: Quartiles for (a) uniform (b) bell-shaped (c) right-skewed and 
(d) left-skewed distributions 


1.2.4.2 Percentiles 


Percentiles are the summary measures that divide ranked data set into 100 equal parts. Each ranked data set 
has 99 percentiles that divide it into 100 equal parts. The k” percentile is denoted by P,. where & is an 
integer in the range | to 99. Figure 1.9 shows the positions of the 99 percentiles. 


Each of these portions contains 1% of the 
observations of a data set arranged in increasing order 


Fig. 1.9: Percentiles 


Hence, the k” percentile, P,,@an be described as a value in a data set such that about k% of the 
measurements are smaller than the value of P, and about (100 — k)% of the measurements are greater than 
the value of P,. The value of the k” percentile is given by 


th 
P= the value of the bed term in a ranked data set 


where k denotes the number of the percentile and n represents the sample size. 


The percentile rank for a particular value x, of a data set is given by 


number of values less than x; 


: a tk (0) 
Percentile rank of x, total number of values in the data set 


The percentile rank of x, gives the percentage of values in the data set that are smaller than xv. 
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Deciles: Deciles are also useful in statistics. The deciles of a data set divide into tenths, or 10 equal parts. 
A typical data set has nine deciles. If we denote them by D,, D,, ... D,, then the first decile, D, is the 
number that divide the bottom 10% of the data from the top 90%; the second decile, D,, is the number that 
divide the bottom 20% of the data from the top 80%; and so on. Hence, the first decile is the lO" percentile; 
the second decile is the 20" percentile, etc. 


1.2.4.3 Skewness and Kurtosis 


Skewness is the degree of asymmetry, or departure from symmetry, of a distribution. For skewed distribution, 
the mean tends to lie on the same side of the mode as the longer tail as shown earlier in Figs. 1.4(a) and (b). 


Skewness is defined as 


mean—mode mes —mode 


Sk <= 1.20 
eee standard deviation Ss co 
or alternatively, 
3(mean—median) 3(X —median) 
SS SS SS (1.21) 


standard deviation S 
The above two measures are called, respectively, Pearson’s first and second coefficient of skewness. 


Other definitions for skewness in terms of quartiles and percentiles are as follows: 


(Q; — Q,)~ (Q, ~ Q)) 


Quartile coefficient of skewness =e (22) 
Q; = Q, 
Q; -20,+Q 
eee (1.22a) 
QO; -Q, 
Py — Pe) - ~ 
10-90 percentile coefficient of skewness = (Bo = Bo) — Fo = Fo) (1.23) 
Fy) — Fo 
fy = 2Fo + Ao 
a ee 1.23a 
Po — Po ae 


If a frequency curve of a distribution has a longer tail to the right of the central maximum than to the 
left, the distribution is said to be skewed to the right or to have positive skewness. If the reverse is true, it 
is said to be skewed to the left or to have negative skewness. 


Kurtosis is the degree of peakedness of a distribution taken relative to a normal distribution. A 
distribution having a relatively high peak such as the curve of Fig. 1.10(a) is called leptokurtic, while the 
curve of Fig. 1.10(b) which is flat-topped is called platykurtic. The normal distribution, Fig. 1.10(c), which 
is not very peaked or very flat-topped, is called mesokurtic. 


Cm ee 


(a) Leptokurtic (b) Platykurtic (c) Mesokurtic 


Fig. 1.10: Kurtosis 


// Numerical Descriptive Measures // 19 
Example E1.11 


The number of hours worked by 24 employees of a company is given below: 


40 43 40 oe 36 as 40 39 
39 52 pit} 50 41 47 40 48 
38 36 25 41 5) 36 16 40 


(a) calculate the three quartiles and the Interquartile range 
(b) find the approximate value of the 81“ percentile 
(c) calculate the percentile rank of 37. 


SOLUTION: 
(a) The ranked data is 
Ine25 527853 56 36 36 38 39 39 39 40 40 40 40 40 41 41 43 44 47 48 50 52 


Q, = median = 40 
36+ 36 
OQ, = > =1315 
41+ 43 
VS =) 
' 2, 


IQR = Interquartile range = Q, - Q, = 42 - 36 =6 
kn _81(24) 
100 ~=100 


Thus, the 81“ percentile can be approximated by the value of the 19" term in the ranked data, 
which is 43. Hence P,, = 45. 
(c) Seven values in the given data are smaller than 37. 


(b) = 19.44 


ay 
Therefore, the percentile rank of 37 = om x100 = 29.17%. 


1.2.4.4 Box-and-Whisker Plot 

A box-and-whisker plot depicts a graphical representation of data using five measures: the median, the 
first quartile, the third quartile and the smallest and the largest values in the data set between the lower and 
the upper inner fences. A box-and-whisker plot shows the center, spread, and skewness of a data set. It is 
constructed simply by drawing a box and two whiskers that use the median, the first quartile, the third 
quartile and the smallest and the largest values in the data set between the lower and the upper inner 


fences. 

The box plot utilizes three fundamental measures of dispersion in a graphic manner for a set of data. 
The simplest measure of dispersion is the range. The box plot as shown in Fig.1.11 also ferstonpenenen the 
interquartile range, Q,— Q,, and the semi-interquartile range, (Q,— Q,)/2. The semi-intenquartile range also 
corresponds to the median of the data set. Outliers are observations that fall well animale errr pattern 
of the data. An outlier may be the result of a measurement or recording error. an observation from a different 
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population, or an unusual extreme observations. We can utilize quartiles and the interquartile range (IQR) 
to identify potential outliers, that is, to spot observations that may be outliers. Referring to Fig.1.11 we 
define the lower and upper limits, the numbers that lie, respectively 1.5 IQRs below the first quartile and 
1.5 IQRs above the third quartile. 
Thus, Lower limit = Q, — 1.5 IQR 
Upper limit = Q, — 1.5 IQR 

Observations that lie outside the lower and upper limits are potential outliers. Typically outliers are 
observations that fall < (Q, — 3 IQR) and > (Q, + 3 IQR). 

Box plots are useful for comparing two or more data sets and to identify the approximate shape of the 
distribution of a data set. Figure 1.12 shows the uniform, bell-shaped, right-skewed and left-skewed 
distributions and their corresponding box plots. The box width and whisker length relate to skewness and 
symmetry. 

First quartile Median Third quartile 


Smallest value Largest value i 
within the two within the two pare! 
inner fences inner fences { 
* 
(Lower limit) (Upper limit) 


ee es ee ee ee ee ee 


Fig. 1.11: Box-and-whisker plot 


Q, Q, oF Q, Q, Q, 
(a) Uniform (b) Bell-shaped 


OO sn, Qa QQ Q 
(c) Right-skewed (d) Left-skewed 


Fig. 1.12: Distribution shapes and box plots for (a) uniform, (b) bell-shaped, 
(c) right-skewed, (d) left-skewed distributions 
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Example E1.12 


Refer to the data in the Example E1.11 and construct a box-and-whisker plot and comment on the skewness 
of the data. 


SOLUTION: 
Median = Q, = 40, Q, = 36, Q, = 42, IQR = Q, - O, = 42-36 =6 
Lower inner fence = Q, — 9 = 36 —-9 = 27 
Upper inner fence = Q, + 9=42+9=5] 
The box-and-whisker plot is shown in Fig. E1.12. 
The longest and smallest values with the two inner fences are 47 and 35 respectively. There are two 


outliers 50 and 52 shown by an asterisk in Fig. El.12. The data in this example is skewed to the right, 
because the lower 50% of the values are spread over a smaller range than the upper 50% of the value. 


Descriptive Statistics 


Variable N Mean Median Tr. Mean St. Dev SE Mean 
Oa 24 38.83 40.00 39.27 781 1.59 
Variable Min. Max. QO, o 

Cl 16.00 52.00 36.00 42 


Tr. Mean ’ St. Dev SE Mean 
39.27 7.81 4.59 


Q; 
42 


20 30 40 50 
Number of hours worked 


Fig. E1.12: Box plot 


PieS  SOMNARY is a a 


In this chapter, we have introduced the types of statistics, namely, the descriptive statistics and the inferential 
statistics. Descriptive statistics consists of methods for organizing and summarizing information. Inferential 
statistics consists of methods for drawing and measuring the reliability of conclusions about a population 
based on information obtained from a sample of the population. 
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PROBLEMS 


P1.1 


| tal 4 


PES 


P1.4 


PES 


The following data give the results of a sample survey. The letters X, Y and Z represent the three 


categories. 
xX Y he > es YZ. Zea 2 Yo Z 


oe & x x ie xX Yo xX. xX .Y Z /oe Ye cee 


N 


(a) prepare a frequency distribution table 

(b) calculate the relative frequencies and percentages for all categories 

(c) what percentage of the elements in the sample belongs to category Y? 

(d) what percentage of the elements in the sample belongs to category X and Z? 

(e) draw bar graph for the frequency distribution. 

Fifteen faculty members at a university were asked whether ethics course should be required of all 


students as a requirement for graduation. The responses of these faculty members are listed below. 
(F, A. and N indicate that a faculty member is in favour, against, or has no opinion, respectively). 


N A A F N N FEF F F A A N GE F A 
(a) prepare a frequency distribution table 

(b) find the relative frequencies and percentages for all categories 

(c) what percentage of the faculty members in this sample are in favour of this issue? 
(d) draw apie chart for the percentage distribution. 


The following data give the weights (in kg) of a random sample of 30 senior students at a university. 
Construct a frequency distribution table. 


81 77 12 75 °80 83 $2 83.85. 77 SOea3 7452 eee 
73, «78 73, 76 8 84 79 80 79 84 76 78 81 7 7 


A sample of randomly selected households in a large city produced the following data on the 
number of vehicles owned. 


=) 1 1 1 1 1 1 Ps 2 1 1 1 3 2 
2 2 4 1 1 0 2 2 1 1 1 0 1 1 
l 2 2 4 i 2 4 2 3 


Draw a bar graph for vehicles owned. 


The following table gives the frequency distribution of ages of all 50 employees of a company. 


18 to 30 


31 to 43 
44 to 56 
57 to 69 
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P1.6 


Jag ahs 


P1.8 


Pi 


(a) find the class boundaries and class width 

(b) do all classes have the same width? 

(c) prepare the relative frequency and percentage distributions 

(d) find the percentage of the employees that are 43 years old or younger. 

The following data give the number of machines manufactured by a company for a sample of 30 
days. 

24 «21 pie 220 35. 27 3 34 23 2 3327 8G 

ee ee 207 2350255523 «35023 C2881 8 CO 


(a) construct a frequency distribution table using the classes 21-23, 24-26, 27-29. 30-32 and 
33-35. 


(b) calculate the relative frequencies and percentages for all classes 

(c) construct a histogram and a polygon for the percentage distribution. 

(d) for what percentage of the days is the number of machines produced in the interval 27 to 29? 
The academic ranks of fifteen faculty members in a department are listed below. F. A and AS indicate 
that a faculty member is a Full Professor, Associate Professor and Assistant Professor. respectively. 
E AS A F A A F AS VE An iB sASoN EAs OE 

(a) prepare a frequency distribution table 

(b) compute the relative frequencies and percentages for all categories 

(c) what percentages of the faculty in this sample is full professor rank? 

(d) draw a pie chart for the percentage distribution. 


The results of a sample survey of three categories 1, 2 and 3 are given below: 
1 1 a 2 3 1 2 I 2 2 3 1 3 
a | 2 | 1 5) 3 3 2 3) 3 2 2 
(a) prepare a frequency distribution table 
(b) calculate the relative frequencies and percentages for all three categories 
(c) find the percentage of the elements in this sample belonging to category 2 
(d) find the percentage of the elements in this sample belonging to category | or 3. 
The following data gives the number of car oil changes made at a service station for a sample of 30 
days. 
23 el 26 24 34 34 30 26 24 27 
2h 27 30 23 28 32 2 23 27 30 
29 35 33 22 27 Pays 24 35 30 28 
(a) construct a frequency distribution table using the classes 
21-23, 24-26, 27-29, 30-32 and 33-35 
(b) calculate the relative frequencies and percentages for all classes 
(c) construct a histogram and a polygon for the percentage distribution 
(d) for what percentage of the days is the number of oil changes made in the interval 30-32? 
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PEO 


PEM 


P1.12 


PLi3 


P1.14 


The following data gives the 1999 autocollision claims paid by an insurance company to 24 
claimants. 


2301 1678 4634 4541 851 2306 «1174 8974 
1132 1828 749 2585 = 971 2891 1275) 
2782 1161 2406 912 3510 820 1560 =: 1208 


(a) construct the cumulative frequency, cumulative relative frequency and cumulative percentage 
distribution table 

(b) draw an ogive for the cumulative percentage distribution 

(c) using the ogive, find the percentage of claimants in this sample who received $2500 or less. 


The following data gives the collision repair cost in dollars for 25 automobiles randomly selected 
form a list of 100 cars. 


2410 739 2500 = 413 556 
2470 =1789 =2109—S 899 987 
2170) 95326 1343. 1158 = 1234 
GISt, 9e4083 5892 2312 43083 
6721 4800 1325 2911 1926 


(a) 
(b) 
(c) 
(d) 


construct a frequency distribution table. Assume the width of each class as $1500 


calculate the relative frequencies and percentages for all classes 


draw a histogram and a polygon for the relative frequency distribution 


what are the class boundaries and the width of the fourth class? 


The following data give the number of unemployed adults in 2008 for twenty states. 


95 
a2 


328 


23 


132 
270 


49 
24 


48 EY, 167 
287 


231 90 67 
347 55 20 16 107 


(a) construct a frequency distribution table. Take the classes as 1-70, 71-140 and so on 


(b) prepare the relative frequency and percentages. 


The following data give the 2008 total annual compensation (in thousands of dollars) for 29 chief 
executive officers of some large corporations. 


Z095 
1135 
1160 
1208 


1678 
2829 
2406 
1029 


4832 


767 
908 
1385 


4433 
2595 
3501 
1011 


O52 2207. 31075 4200 
977 2891 1276 2781 
811 1568 964 Tai 
2815 


(a) construct a frequency distribution table. Take $1 thousand as the lower limit of the first class 
and $1000 thousand as the width of the class 


(b) prepare the relative frequency and percentages. 


The following data give the number of children less than 18 years of age for 30 randomly selected 


families. 
2 1 1 
] 2 2 


1 
1 


/! Numerical Descriptive Measures // 25 
$$ eeesssssssC(‘“‘_ OY SNUUMEOYiCAL Descriptive Measures Ii 25 


(a) prepare a frequency distribution table using single-valued classes 
(b) calculate the relative frequencies and percentages for all classes 
(c) how many families in this sample have 2 or 3 children less than 18 years of age? 
(d) draw a bar graph for the frequency distribution. 
P1.15 Refer to Problem P1.5. 
(a) prepare a cumulative frequency distribution table 
(b) calculate the cumulative relative frequencies and cumulative percentages for all classes 
(c) what percentage of the employees are 44 years of age or older? 
(d) draw an ogive for the cumulative percentage distribution 
(e) using the ogive, find the percentage of employees who are 40 years old or younger. 


P1.16 Refer to Problem P1.6. Prepare the cumulative frequency, cumulative relative frequency and 
cumulative percentage distribution. 


P1.17 Refer to Problem P1.13. Prepare the cumulative frequency, cumulative relative frequency and 
cumulative percentage distribution by using the table in Problem P1.13. 


P1.18 Following are the SAT scores (out of a maximum possible score of 1600) of 12 students who took 
this test. 
786 892 997 1170 ~—s 881 1042 
795 1084 = 773 980 1128 1066 


Prepare a stem-and-leaf display. Arrange the leaves for each stem in increasing order. 


P1.19 The following data give the time (in minutes) taken to commute from residence to work for 20 
workers of accompany. 
10 50 64 34 47 5 12 23 a0, 27 


26 32 17 a 13 19 Pe) 43 21 22 
Construct a stem-and-leaf display for these data. Arrange the leaves for each stem in increasing 
order. 


P1.20 At the end of a particular training program, 400 students in a college are judged by a panel of 
examiners as excellent, very good, good, or fair. The following data give the distribution: 


Excellent 80 
Very good 140 
Good 120 
Fair 60 


Represent the figures by a pie chart. 
P1.21 The following are the test scores of 60 students on engineering ethics test. 

43 46 48 50 =| =| D2 52 53 Be 
a 55 56 56 a 58 60 66 67 67 
68 68 70 a IZ 1& a5 74 75 f fe) 
12) 75 ag 77 78 78 76 19 80 
80 80 80 81 82 . 82 82 83 83 85 
84 84 85 86 86 86 88 91 92 93 
Construct a ranked stem-and-leaf display of the data. 
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P1.22 


P1.23 


P1.24 


P25 


P1.26 


P1.27 


PIs 


Plot the stem-and-leaf diagram for the following cholesterol readings (mg/deciliter) of twenty patients. 
184 231 195 186 241 190 239 254 225 237 


21 224 214 197 204 235 198 215 216 205 


The following are the scores of 30 students on math test: 

75 53 80 96 66 78 2 87 O23 ys 

69 72 81 62 76 86 79 68 50 92 

83 84 Vd 64 7} 87 a2 53 58 ey) 

Construct a ranked stem-and-leaf display of the data. 

The following data give the monthly rent paid by a sample of 30 households selected from a city. 
428 586 735 675 550 989 1021 620 750 660 

540 578 956 1030 1070 930 871 765 880 975 

651 1020 950 840 781 870 900 801 750 821 

Construct a stem-and-leaf display for the above data. 


The following data gives the moneys donated by 32 university faculty members to a charity during 
the calendar year 2008. 


473 Al7 173 109 178 287 117 is) 

281 474 589 379 89 255 414 ay) 

412 608 454 aly 355 611 469 601 

453 287 489 484 616 354 189 388 

(a) prepare a stem-and-leaf display for these data using the last two digits as leaves 
(b) condense the stem-and-leaf display by grouping the stem as 0-1, 2—3 and 4-6. 


The following data gives the number of hours spent exercising in a gymnasium by 10 randomly 
selected members of a university during the past week. 


8 B 6 0 1 6 11 4 0 7 
Compute the mean, median and mode. 


The following data gives the number of vacuum cleaners produced by a manufacturing company 
for a sample of 12 days. 


235 33 28 24 36 ul 30 2 2A 29 31 23 
Calculate the mean, median and mode. 


The following data gives the number of heavy machinery sold by a large manufacturing company 
for the 12 months in the year 2008. 


3 2 4] 118 204 97 
68 86 @2 BY 98 9 


Find the mean, median and mode. 
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P1.30 


P1.31 


P32 
P33 


P1.34 


P1-35 


P1.36 


ial I 


P1.38 


The following data are the test scores (in points) of 9 students in a class out 6 points: 
5.8 =i 5.9 Ey 52 Sof 3.7 5.7 5:6 


Find the mean, median and mode for this data. 


The following data gives the number of ball bearings manufactured by a company per day for a 
sample of 10 days. 


24 32 oat 23 35 a 29 21 23 28 

Calculate the mean, median and mode for these data. 

The following data give the number of automobile accidents in a city during the past 12 days. 
3 7 11 5 6 3 8 7 6 9 B 2 
Calculate the range, variance and standard deviation. 

Compute the range, variance and standard deviation for the data given in Problem P1.26. 


The following data give the number of hours spent doing the homework and attending classes by 
10 randomly selected college students during the past week. 


18 16 2] 4 39 35 42 29 31 24 
Compute the range, variance and standard deviation. 


The following data give the number of new automobiles sold at a dealership during a 15 days 
period. 


i345 9 7 6 10 11 14 12 8 4 8 16 10 8 
Find the range, mean, variance and standard deviation. 


Following are the 2008 earnings (in thousands of dollars) before taxes of all 10 employees of a 
small company: 


30 35 Ph 28 19 47 52 4 39 32 

Calculate the variance and standard deviation of these data. 

The following data give the number of automobiles that stopped at a service station during each 
of the 10 hours observed: 

30 32 35 29 2B 17 19 30 18 B 


Find the range, variance and standard deviation. 


Consider the following two data sets: 
Data set 1: 8 12 2 37 4] 


Data set 2: 15 19 32 4 48 
Calculate the standard deviation of each data set. Comment on the results obtained. 


The following table gives the frequency distribution of the amounts of long distance telephone 
bills for the month of August 2008 for a sample of 60 families in a city. 
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Number of families 


Amount of long distance telephone bill (dollars) 


25 to less than 50 15 
50 to less than 75 iZ 
75 to less than 100 20 
100 to less than 125 8 


125 to less than 150 5 


Calculate the mean, variance and standard deviation. 


P1.39 Using the sample formulae, find the mean, variance and standard deviation for the grouped data 
given in the following table: 


0 to less than 4 
4 to less than & 


8 to less than 12 
12 to less than 16 
16 to less than 20 
20 to less than 24 


P1.40 The following table gives the frequency distribution of entertainment expenditure (in dollars) 


incurred by 50 families during the past week. 
Number of families 


Entertainment expenditure 
5 
20 to less than 30 


40tolessthan50 | 6 
50 to less than 60 iS 


Find the mean, variance and standard deviation. 


P1.41 The following table gives the grouped data on the weights of all 100 babies born at a hospital in 
2005. 


Weight (kg) 


1 to less than 2 


Number of babies 
5 


2 to less than 3 30 
3 to less than 4 40 
4 to less than 5 20 


5 to less than 6 Ss 


Find the mean, variance and standard deviation. 


P1.42 The following table gives the frequency distribution of the number of personal computers sold 


during the past month at 40 computer stores in a big city. Calculate the mean, variance and standard 
deviation. 
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P1.43 


P1.44 


P1.45 


P1.46 


Computers sold | Number of stores 
4to 12 
Isite 21 
22 to 30 
31 to 39 
40 to 48 


The following table gives the number of television sets owned by 80 households. 


Number of television | Number of households 
sets owned 
0 4 7 | 
1 ae 
Z 27 
3 iM 
4 6 | 


Find the mean, variance, and standard deviation. Use the number of television sets owned as the 
values of m in the formulae. 


The following data gives the scores of 12 students in Design for Economy and Reliability class. 


74 1 67 52 98 al (e) 72 84 87 90 78 


(a) calculate the values of the three quartiles and the interquartile range. Where does the value 
of 80 ilies relative to these quartiles? 


(b) calculate the value of the 93" percentile. Comment on the calculated percentile. 

(c) determine the percentile rank of 81. Comment on the percentile rank. 

The following data gives the number of hours worked in the last week by 30 employees of a 
manufacturing company. 

38 40 42 45 oD 47 28 40 43 39 


49 39 21 31 40 33 39 16 36 4] 

(a) calculate the values of the three quartiles and the interquartile range 

(b) calculate the approximate value of the 79" percentile 

(c) calculate the percentile rank of 39. 

The following data gives the number of electronic components produced at ABC company for a 

sample of 30 days. 

24 32 27 23 6) 33 29 23 25 28 

20 20 26 51 33 27 P| 28 29 23 

36 Zi 26 34 Pa) 36 Pee) 28 31 27 

(a) calculate the values of the three quartiles and the interquartile range. Where does the value 
of 31 lie in relation to these quartiles? 


(b) calculate the value of the 65" percentile 
(c) calculate the percentage of days when the number of electronic components produced was 


32 or higher. 
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P1.47 The following data gives the number of new computer systems sold during a 19 days period. 
93 84 75 63 97 81 73 69 47 58 
SF) 78 84 Te 45 69 83 98 84 
(a) calculate the values of the three quartiles and the interquartile range. Where does the value 
of 81 lies in relation to these quartiles? 
(b) calculate the approximate value of the 93° quartile 
(c) find the percentile rank of 82. 
P1.48 The following are the scores of 20 students in a Mathematics class: 
48 82 82 83 70 as 78 74 46 81 
78 74 64 80 a2 80 80 ik) 80 80 
(a) find the quartiles 
(b) find the interquartile range 
(c) find the five number summary. 


P1.49 The random sample of 21 tourists yielded the following data on length of stay (in days) in a summer 


resort. 
4 3 10 4 6 13 12 15 5 18 f 
a 12 Z 2 6 25 23 1 S) ° 


(a) find the quartiles 
(b) find the interquartile range 
(c) find the five number summary. 
P1.50 The following are the scores of 12 students in a statistics class. 
74 81 68 Si yi) 57 76 re: 85 88 90 79 
(a) find the values of the 3 quartiles 
(b) find the interquartile range. 
P1.51 The following are the ages of 9 employees of a small manufacturing company. 
47 28 40 51 34 SH 62 23 33 
(a) find the values of the three quartiles 
(b) find the interquartile range. 
P1.52 Refer to Problem P1.50. Find the value of the 62" percentile. 
P1.53 Refer to Problem P1.50. Find the percentile rank for the score 85. 
P1.54 A sample of 20 full-time students in a college yielded their weekly times spent on attending classes 
and doing their homework in hours. Determine the quartiles for these data. 
43 6 16 21 32 14 38 20 27 31 
a2 30 25 64 34 30 41 35 26 38 


P1.55 Refer to Problem P1.54. 
(a) find the five-number summary 
(b) find the interquartile range 
(c) obtain the lower and upper limits 
(d) identify potential outliers, if any. 
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P56 


PES, 


P1.58 


Pis9 


P1.60 


P1.61 


The following data gives the incomes in thousands of dollars for a sample of 12 employees in a 
company 


22 16 ol 59 21 Sih 28 5/ 41 91 26 45 
Construct a box-and-whisker plot for this data set. 


The following table shows the number of used cars sold each week during 10 weeks period by a 
car dealership in a large city. 


66 96 147 147 154 5 88 154 57 116 
(a) find the 3 quartiles 

(b) determine the interquartile range 

(c) identify the potential outliers, if any 

(d) construct a box plot. 


The following data give the time (in minutes) taken by 14 students to complete a Maths test. 
Poe il on fil 74 98 25 89 88 19 93 96 87 oi) 
Prepare a box-and-whisker plot. Comment on the skewness of these data. 

The following data give the income (in thousands of dollars) for a sample of 12 households. 
17 Mie 32 22 a2 60 29 38 92 42 27 8646 
Construct a box-and-whisker plot for these data. 

The following data give the hours worked by 30 employees of a company during one week. 
43 39 Zt 40 42 40 35 45 38 47 

40 Sih 36 40 40 23 42 ao all 48 

36 41 39 16 48 Dy ad 34 40 34 

Make a box-and-whisker plot. 


A corporation manufactures ball bearings. The following data give the number of bearings produced 
at the company for a sample of 30 days. 


31 33 ae 34 2 28 26 Pall 31 SP) 
24 2g 32 23 oo 33 29 23 28 25 
21 26 20 BI 33 27 23 ay 29 28 


Prepare a box-and-whisker plot. Comment on the skewness of these data. 


REVIEW QUESTIONS 


Describe the need to group data in the form of a frequency table. 

Describe how the relative frequencies and percentages of categories obtained from the frequencies 
of categories. 

Describe the steps to be made to group a data set in the form of a frequency distribution table. 
How are the relative frequencies and percentages of classes obtained from the frequencies of 


classes? 
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5. Describe the concept of cumulative frequency distribution. How are cumulative relative frequencies 
and cumulative percentages calculated? 
Briefly explain how a stem-and-leaf display for a data set is prepared. 


7. Describe very briefly the following: 


(a) bar graph (b) class 

(c) class boundary (d) class frequency 
(e) class width or size (f) grouped data 
(g) histogram (h) ogive 


(i) pie chart 
8. Explain the difference between ungrouped and grouped data. 

Briefly explain the meaning of the following terms: 

(a) median (b) outlier 

(c) mean (d) mode 
10. Explain the relationship between mean, median and mode for symmetric and skewed histograms. 
11. Explain the difference between a population parameter and a sample statistic. 


12. Are the values of the mean and standard deviation that are calculated using grouped data exact or 
approximate values of the mean and standard deviation, respectively? Explain. 


13. Describe how the three quartiles are calculated for a data set. 

14. What is interquartile range? 

15. Describe how the percentiles are calculated for a data set. 

16. Describe the concept of the percentile rank for an observation of a data set. 
17. Explain what summary measures are used to construct a box-and-whisker plot. 


18. Define the following terms: 


(a) coefficient of variation (b) measures of dispersion 

(c) measures of position (d) parameter 

(e) range (f) first quartile 

(g) second quartile (h) statistic 

(i) third quartile (j) upper inner fence 

(k) upper outer fence (1) variance 
|. Variable is a characteristic that varies from one person or thing to another. (True/False) 
2. Qualitative variable is a numerically valued variable. (True/False) 
3. Quantitative variable is a numerically valued variable. (True/False) 
4. Discrete variable is a qualitative variable whose possible values can be listed. (True/False) 
5. Continuous variable is a quantitative variable whose possible values form some 

interval of numbers. (True/False) 

Data is values of a variable. (True/False) 


7. Qualitative data is values of a qualitative variable. (True/False) 
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Continuous data is values of a continuous variable. (True/False) 
Qualitative variable yields non-numerical data. (True/False) 
Classes are categories for non-grouping data. (True/False) 
Frequency is the number of observations that fall in a class. (True/False) 
Frequency distribution is a listing of all classes and their relative frequencies. (True/False) 
Relative-frequency distribution is a listing of all classes and their frequencies. (True/False) 
Lower cut point is the smallest value that could go in a class. (True/False) 
Upper cut point is the smallest value that could go in the next higher class (equivalent to the 
lower cut point of the next higher class). (True/False) 
The middle of a class is found by summing its cut points. (True/False) 
Width is the average between the cut points of a class. (True/False) 
Grouping can help to make a large and complicated set of data more compact and easier to 
understand. (True/False) 
The concepts of cut points and mid points make sense only for numerical data (for which doing 
arithmetic is meaningful). (True/False) 
The frequency of a class is the number of observations in the class, whereas the relative frequency 
of a class is the ratio of the class frequency to the total number of observations. (True/False) 
The percentages of a class equals 100 times the relative frequency of the class. (True/False ) 


The relative frequency of a class is the percentage of the class expressed as a decimal.(True/False) 


Two data sets that have identical frequency distributions have identical relative-frequency 


distributions. (True/False) 
Two data sets that have identical relative-frequency distributions have identical frequency 
distributions. (True/False) 
The four elements of a grouped-data table are the classes. frequencies, relative frequencies, and 
mid points. (True/False) 
The mid point of each class is not the same as the class. (True/False) 
Relative frequencies always lie between 0 and | and hence provide a standard for comparison. 
(True/False) 
A frequency histogram displays the class frequencies on the vertical axis, whereas a relative- 
frequency histogram displays the class relative frequencies on the vertical axis. (True/False) 
Stem-and-leaf diagrams are generally useful with large data sets. (True/False) 


The distribution of a data set is a table, graph. or formula that provides the values of the 


observations and how often they occur. (True/False) 
Sample data are the values of a variable for a sample of the population. (True/False) 
Population data are the values of a variable for an entire population. (True/False) 
Census data is another name for sample data. (True/False) 

(True/False) 


A sample distribution is the distribution of population data. 
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The population distribution is the distribution of population data. (True/False) 
Distribution of a variable is another name for the population distribution. (True/False) 


The measure of center is to indicate where the center or most typical value of data 
set lies. (True/False) 


The three most important measures of center are the mean, the median and the mode. (True/False) 


The mean, median and mode can all be used with quantitative data. (True/False) 
If the number of observations is even, then the median is the observation exactly the middle of the 
ordered list. (True/False) 
If the number of observations is odd, then the median is the mean of the two middle observations 
in the ordered list. (True/False) 
The range of a data set is the difference between the maximum (largest) and minimum (smallest) 
observations. (True/False) 
The more variation that there is in a data set, the larger is its standard deviation. (True/False) 
Almost all the observations in any data set lie within three standard deviations to either side of 
the mean. (True/False) 
The purpose of a measure of variation is to indicate the amount of variation in a 
data set. (True/False) 
The first quartile is the median part of the entire data set that lies at or below the median of the 
entire data set. (True/False) 
The second quartile 1s the median of the entire data set. (True/False) 
The third quartile is the median part of the entire data set that lies at or below the median of the 
entire data set. (True/False) 
The interquartile range is the difference between the first and third quartiles. (True/False) 
The median and interquartile range are resistant measures, whereas the mean and standard 
deviations are not. (True/False) 
Box plots are useful when the data set (number of observations) is large. (True/False) 
Parameter is a descriptive measure for a population. (True/False) 
Statistic is a descriptive measure for a sample. (True/False) 
For an observed value of a variable x, the corresponding value of the standardised variable z is 
called the x-score of the observation. (True/False) 
Numbers that are used to describe data sets are called descriptive measures. (True/False) 


Descriptive measures that indicate where the center or most typical value of a data set lies are 
called measures of dispersion. (True/False) 


Descriptive measures that indicate the amount of variation or spread, in a data set are called 
measures of variation. (True/False) 
Almost all the observations in any data set lie within 3 standard deviations to either side of the 
mean. (True/False) 
An outlier is an observation that falls well outside the overall pattern of the data. (True/False) 
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ANSWERS TO STATE TRUE OR FALSE 


1. True 2. False 3.True 4. False 5. True 6.True 7. True 8. True 9. True 10. False 
11. True 12. False 13. False 14. True 15. True 16. False 17. False 18. True 19. True 20. True 
21. True 22. True 23. True 24. True 25. True 26. False 27. True 28. True 29. False 30. True 
31. True 32. True 33. False 34. False 35. True 36. True 37. True 38. True 39. True 40. False 
41. False 42. True 43. True 44. True 45. True 46. True 47. True 48. False 49. True 50. True 
51. True 52. True 53. True 54. False 55. True 56. False 57. True 58. True 59. True 


Ben 


CHAPTER 


Probability 


In Chapter-1, we have discussed on the descriptive statistics, that is, methods for organising and summarising 
numerical data. Another important aspect is to present the fundamentals of inferential statistics, that is, 
methods of drawing conclusions about a population based on information from a sample of the population. 
Because inferential statistics involves utilising information from part of a population (a sample) to infer 
conclusions about the entire population, we can never be certain that our conclusions (inferences) are 
correct or true, that is, uncertainty is inherent in inferential statistics. Therefore, we need to be familiar with 
uncertainty before we can understand, develop, and apply the methods of inferential statistics. 


The science of uncertainty is called probability theory. Probability theory enables us to evaluate and 
control the likelihood that a statistical inference is correct. More generally, probability theory provides the 
mathematical basis for inferential statistics. 


This chapter introduces the basic concepts and definitions on probability, events (simple and 
compound), Venn diagram, tree diagram, approaches to probability (classical, relative frequency concept of 
probability, subjective probability, marginal probability and conditional probability. Special multiplication 
rule and Baye’s formula are also briefly presented. 


2.1 


EXPERIMENT, OUTCOME AND SAMPLE SPACE 


The tossing of a coin or the rolling of a die constitutes an experiment. In probability and statistics, the 
term experiment is used in a very wide sense and refers to any procedure that yields a collection of outcomes. 
The knowledge of all possible outcomes when a coin is tossed, or a die is rolled, is important. This is 
always the case for determining the probabilities. Measuring the length of a bolt, weighing the contents of 
a box of materials, and measuring the breaking strength of a metal component are all examples of experiments. 


An experiment is a process that, when performed, results in one and only one of many observations. 
These observations are known as the outcomes of the experiment. The collection of all outcomes for an 
experiment is called a sample space. For tossing a coin, we can use the set { Heads, Tails} as the sample 
space. For rolling a six-sided die, we can use the set {1, 2, 3, 4, 5, 6}. These sample spaces are finite. Some 
experiments have sample spaces with an infinite number of outcomes. The elements of a sample space are 
called sample points. The sample space for an experiment can be described by drawing either a Venn diagram 
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or a tree diagram. A Venn diagram is a picture that depicts all the possible outcomes for an experiment. In 
a tree diagram, a branch of the tree represents each outcome. 


2.2 _SIMPLE AND COMPOSITE EVENTS 2 


An event is a collection of one or more of the outcomes of an experiment. An event may be simple event or 
a compound event as shown in Fig. 2.1. A simple event is also called an elementary event, and a composite 
event is also called a compound event. An event that includes one and only one of the final outcomes for 
an experiment is called a simple event and is generally denoted by E. A compound event is a collection of 
more than one outcome of an experiment. The probability of a composite event is the sum of the probabilities 
of the simple events of which it is composed. When all the sample points or simple events of an experiment 
are equally likely and so have equal probabilities, then the probability of a composite event is easily found. 
For instance, if there are k sample points and a composite event A contains r of them. then P(A) =—. 

The probability that a simple event £, will occur is denoted by P (E,), and the probability that a 
compound event A will occur is denoted by P(A). 


SIMPLE EVENT COMPOSITE EVENT 
(ELEMENTARY EVENT) (COMPOUND EVENT) 
An event that includes one and Acollection of more than one 
only one of the final outcomes outcome of an experiment 

of an experiment 


Fig. 2.1: Classification of an event 


An impossible event is one that has no outcomes in it and consequently, cannot occur. On the other 
hand, a sure event is one that has all the outcomes of the sample space in it and will. therefore, definitely 
occur when the experiment is performed. Thus, the sample space constitutes a sure event. 

Probability Definition: Let us assume that the sample space S has N outcomes ¢,, ¢,. e, so that there are NV 
simple events {e,}, {e,}, {e,}- | | 

The probability of a simple event {e} is a number denoted by Pi{e}] and satisfies the following 
conditions: 

1. P[{e}] is always between 0 and 1 that is 0 < P [{e}] <1. 

2. The sum of the probabilities of all the simple events is 1; that is, P [dpb [teshlt «Pigalle 

The probability of an event A, denoted by P(A). is defined as the sum sian probabilities assigned 7 
the simple events that comprise the event A. The impossible event has probability 0 and the sure event has 


probability 1. 
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Find the probability of getting (a) exactly two tails (event A), (b) at least two tails (event B) in tossing 3 
balanced fair coins. 


SOLUTION: 
(a) The event A of exactly two tails comprises of the sample points (1TH, THT, HTT). 


p( Aye ee 
Hence a eeine 6 
(b) The event B of at least two tails comprises of the sample points (777), (1TH), (THT) and (HTT). 
P(B) ale be X 
Hence, ete Ge geen 
Example E2.2 


Find the probability that the sum of the numbers shown in the two faces, when two dice are thrown (a) is 8 
and (b) 10. 
SOLUTION: 


(a) The event A that the sum of the numbers shown on the two faces is eight consists of the sample 
points: 
(OU TD) (CZ), (Gs, SY) (EEE IG, SO, 2) ararel (7, 10) 


There are 7 sample points. Therefore, the required probability is =. 


(b) The event that the sum is 10 consists of the following sample points: 
(4, 6), (5, 5) and (6, 4) 


: meee GS 1 
Hence, the required probability is — = —. 
q p y 36 12 


2.3 AXIOMS OF PROBABILITY a , 


The subject of probability is based on three rules, known as axioms. They are: 
1. Let S be a sample space. Then P(S) = 1. 
2. For any event A,0< P(A) <1. 
3. If A and B are mutually exclusive events, then P(A U B) = P(A) + P(B) 
More generally, ifAgAy, A,,... are mutually exclusive events then P(A,VA, (A Gi) =P (A,)+P Alas 


24 FINITE PROBABILITY SPACES ; 


Consider a sample space S and the class C of all events. We assume here that if S is finite, then all subsets 


of S are events. S then become a probability space by assigning probabilities to the events in C so that 
they satisfy the probability axioms. 
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If S is a finite sample space with n elements and suppose the physical characteristics of the experiment 
suggest that the various outcomes of the experiment be assigned equal probabilities. Then S becomes a 
probability space, called a finite equiprobable space, if each point in S is assigned the probability 1/n and 
if each event A containing r points is assigned the probability r/n. 


number of elemens in A _ nA) 


ence, “P(A)= = 
number of elements inS n(S) 


(2.1) 


number of ways that t 
és P(A) = y: he event A can occur 22) 
number of ways that the sample sapce $ can occur 


Let S = {a,, a,, ..., a,} be a finite sample space. Then, a finite probability space or finite probability 
model is obtained by assigning each point a in S a real number p,, called the probability of a. satisfying the 
following properties: : 

1. Each p, is nonnegative, p, 2 0. 
2. The sum of the p, = 1, that is, p,+p,++°+p,=1. 
The probability P(A) of an event A is defined to be the sum of the probabilities of the points in A. 


2.5 -- INFINITE PROBABILITY SPACES 


There are two cases: S is countably infinite and S is uncountable. A finite or countably infinite probability 
space S is said to be discrete. An uncountable space S$ which consists of a continuum of points Is said to 
be continuous. 


Case 1: Countably Infinite Sample Spaces 

Suppose S is a countably infinite sample space: S = {a,, a,. ...}. Then, a probability space is obtained 
by assigning a, € S areal number p,, called its probability, such that 

I. Each p, is nonnegative, or p, 2 0. 

2. The sum of the p, is equal to | 


orp, +p, +" = ide I. 
i=l 


The probability P(A) of an event A is then equals the sum of the probabilities of its points. 


Case 2: Uncountable Spaces 
The probability of an event A is given by the ratio of m(A) to m(S): 


P(A) = length of A 
is length of S 
area of A 
PrAY= (2.3) 
or oe area of S 
volume of A 
or P(A) =e cle 


volume of $ 
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where the uncountable sample spaces S are those with some finite geometrical measurement such as length, 
area, or volume and in which a point is selected at random. Such a probability space is said to be uniform. 


2.6 PROPERTIES OF PROBABILITY res ; 


The basic properties of probabilities are: 
1. The probability of an event always lies in the range of 0 and 1. That is 
0=PEy=1 Simple event 
O<P(A)<1 Compound event 
2. The sum of the probabilities of all simple events or final outcomes for an experiment, denoted by 
XP(E,), is always equal to 1. 
Hence, LP(E)=PG)+ RE) +" =1 (2.4) 
3. The probability of an event that cannot occur is 0. (An event that cannot occur is called an 
impossible event). 
4. The probability of an event that must occur is 1. (An event that must occur is called a certain 
event). 


2.7 VENN DIAGRAM | 


Graphical display of events are helpful for explaining and understanding probability. Venn diagrams, named 
after English logician John Venn (1834-1923) are considered one of the excellent ways to visually display 
events and relationships among events. The sample space is shown as a rectangle and the various events 
are drawn as circles (or other geometric shapes) inside the rectangle. Venn diagram for one event is shown 
in Fig. 2.2. 

Each event A has a corresponding event defined by the condition that “A does not occur”. That event 
is called the complement of A and is denoted (not A) or (A’) as shown in Fig. 2.2(b). Event (not A) consists 
of all outcomes not in A, as shown in the Venn diagram in Fig. 2.2(a). With any two events, say, A and B, we 
can associate two new events. One new event is defined by the condition that “both event A and event B 
occur” and is denoted (A & B) or (A and B). Event (A and B) consists of all outcomes common to both event 
A and event B, as shown in Fig. 2.2(d). The other new event associated with A and B is defined by the 
condition “either event A or event B or both occur” or equivalently, that, “at least one of events A and B 


occur’. That event is denoted (A or B) and consists of all outcomes in either event A or event B or both, as 
shown in Fig. 2.2(c). 


Relationship among Events 


(not A): The event “A does not occur”. 
(A and B) or (A & B): The event “both A and B occur’. 
(A or B): The event “either A or B or both occur’. 


Note here that event “both A and B occurs” is the same as the event “both B and A occur’, event 


(A and B) or (A & B) is the same as event (B and A) or (B & A). Similarly, event (A or B) is the same as event 
(B or A). 
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A 
(not A) 
(a) EventA (b) Complement of event A(A’) 
A 
(Aor B) 
(c) Union of events A and B (AUB) (d) Intersection of events A and B (AnB) 
Event Aor B Event (A and B) 


Fig. 2.2: Venn diagram 


2.8 PROBABILITY TREE OR TREE DIAGRAM — 


In a (finite) stochastic process, in which a finite sequence of experiments where each experiment has a 
finite number of outcomes with given probabilities, the most convenient way to describe such a process is 
by means of a labeled tree diagram. 


Various events are drawn as circles (or other geometric shape such as a rectangle, a square, or a circle) 
that depicts all the possible outcomes for an experiment. For instance, the shaded regions of the four Venn 
diagrams of Fig. 2.2 represents respectively, event A, the complement of event A, the union of events A and 
B, and the intersection of events A and B. 

If A and B are any two subsets of a sample space S, their union AUB is the subset of S that contains all 
the elements that are either in A, in B, or in both; their intersection ANB is the subset. S that contains all the 
elements that are in both A and B; on the complement A’ of A is the subset of S that contains all the elements 
of § that are not in A. 

Figure 2.3(a) indicates that events A and B are mutually exclusive, that is, the two sets have no elements 
in common (or the two events cannot both occur). This is written ANB = ¢ denotes the empty set, which 
has no elements at all. Figure 2.3(b) shows that A is contained in B, we write this as ACB. 


q Y ‘© 


(a) Events A and B mutually exclusive (b) Event A contains in Event B 


Fig. 2.3: Venn diagram sharing special relationships among events A and B 
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Example E2.3 


hh OE 


A bin contains a certain number of manufactured mechanical components, a few of which are defective. 
Two components are selected at random from this bin and inspected to determine, if they are non-defective 
or defective. How many total outcomes are possible? Draw a tree diagram for this experiment. Show all the 
outcomes in a Venn diagram. 


SOLUTION: 


Let 


G = the selected component is good 


D = the selected component is defective 


The four outcomes for this experiment are: GG, GD, DG and DD. 


The Venn and tree diagrams are shown in Figs. E2.3(a) and (5). 


Second component Final outcome 
G GG 


Ss First component , 


GD 
DG 


DD 
(a) Venn diagram (b) Tree diagram 


Fig. E2.3: Venn and tree diagrams 


Example E2.4 


In Fig. E2.4, H is the event that an employee has health insurance and D is the event that the employee has 
disability insurance. 


(a) 
(b) 
(c) 
(d) 
(e) 


express in words what events are represented by regions 1, 2, 3 and 4 
what events are represented by regions 1 and 2 together 

regions 2 and 4 together 

regions 1, 2 and 3 together 

regions 2, 3 and 4 together. 


Fig. E2.4: Venn diagram 
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SOLUTION: 
(a) : The employee has health insurance and disability insurance 
: The employee has health insurance but no disability insurance 


: The employee has disability insurance but no health insurance 


WN — 


: The employee has neither health insurance nor disability insurance 
(b) The employee has health insurance. 

(c) The employee does not have health insurance. 

(d) The employee has either health or disability insurance, but not both. 
(e) The employee does not have both kinds of insurance. 


Example E2.5 


A small box contains a few red. and a few blue balls. If two balls are randomly drawn and the colours of 
these balls are observed, how many total outcomes are possible? Draw a tree diagram for this experiment. 
Show all the outcomes in a Venn diagram. 


SOLUTION: 
Let R = a red ball is selected 
B =a blue ball is selected 
The experiment has 4 outcomes: RR, RB, BR and BB. 
The Venn diagram and tree diagram are shown in Figs. E2.5(a) and (5). 


Second ball Final outcomes 
R RR 
Ss First ball 
R 
RB 
BR 
| 
I 
i 
' 
' 
4 
' 
i) 
7 BB 
1 
(a) Venn diagram (b) Tree diagram 
Fig: E2.0 


Example E2.6 


Draw a tree diagram for 3 tosses of a balanced (fair) coin. List all outcomes for this experiment in a sample 


space S. 
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SOLUTION: 
Let H = a toss results in head 


T = a toss results in tail. 
Hence, the sample space is written as S = {HHH, HHT. HTH, HTT, THH, THT, TTH, TTT}. 


First toss Second toss Third toss Final outcomes 
HHH 


HHT 
HTH 


HTT 
THH 
THT 


TTH 


ala 


Fig. E2.6: Tree diagram 


Example E2.7 


Refer to Example E2.5. List all the outcomes included in each of the following events. Indicate which are 
simple and which are compound events. 

(a) both balls are different colours 

(b) at least one ball is red 

(c) not more than one ball is blue 

(d) the first ball is blue and the second is red. 


SOLUTION: 


(a) {RB, BR} ; a compound event 

(b) {RR, RB, BR} ; a compound event 

(c) {RR, RB, BR} ; a compound event 
{BR} ; a simple event. 


(d) 


APPROACHES TO PROBABILITY 


There are three conceptual approaches to probability. They are 


1. classical probability, 
2. the relative frequency concept of probability and 
3. the subjective probability concept. 


2.9.1 Classical Probability 


Outcomes that have the same probability of occurrence are called equally likely outcomes. 


7 Two or more outcomes or events that have the same probability of occurrence are said to be equally 
likely outcomes or events. If E is an event, then P(E) stands for the probability that event E occurs. 


l 


PE = 25 
total number of outcomes for the experiment ad 
number of outcomes favourable to A 
P(A) = —— (2.6) 
total number of outcomes for the experiment ” 


2.9.2 Relative Frequency Concept of Probability 


If an experiment is repeated n times and an event A is observed f times, then, according to the relative 
frequency concept of probability 


P(A) = if 
n 
The relative frequencies are not probabilities but approximate probabilities. 
The law of large numbers states that if an experiment is repeated again and again, the probability of an 
event obtained from the relative frequency approaches the actual or theoretical probability. 


2.9.3 Subjective Probability 


Subjective probability is assigned arbitrarily. It is the probability assigned to an event based on subjective 
judgement, experience, information, and belief. 


2.9.4 Marginal Probability 
Marginal probability is the probability of a single event without consideration of any other event. Marginal 
probability is also known as simple probability. 


2.9.5 Conditional Probability 
It is the probability that an event will occur given that another event has already occurred. 

If A and B are two events, then the conditional probability of A and B is written as P(A|B) and read as 
“the probability of A given that B has already occurred”. 

The conditional probability of an event is the probability that the event occurs under the assumption 


that another event has occurred. The probability that event B occurs given that event A has occur is called 
a conditional probability. It is denoted by the symbol P(B/A), which is read as “the probability of B given 


A”. A is called the given event. 


Joint and Marginal Probabilities 
Data obtained by observing values of one variable of a population are known as univariate data. Data 
obtained by observing two variables of a population are called bivariate data, and a frequency distribution 


for bivariate data is called a contingency table or two-way table. 
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A small bin contains 50 rubber balls. Of them, 26 are red and 24 are green. If one ball is randomly selected 
out of this bin, what is the probability that this ball is 


(a) red 


(b) green 


SOLUTION: 


26 
(a) Pvball selected is red) = 507 Oz 
; 24 
(b) P(ball selected is green) = 50 = 0.48 


Example E2.9 


SS 
A multiple-choice question in a test contains five answers. If a student chooses one answer based on 
“pure guess”, what is the probability that this student answer is 


(a) correct 
(b) wrong 
Does these probabilities add up to 1? If yes, why? 
SOLUTION: 
: l 
(a) P(student’s answer is correct) = 5 = (I)? 
; 4 
(b) P(student’s answer is wrong) = 5 = 0.8 


Yes, these probabilities add up to 1 because this experiment has two and only two outcomes, and according 
to the second property of probability, the sum of their probabilities must be equal to 1. 


Example E2.10 


A sample of 1000 families showed that 35 of them own no automobiles, 207 own one automobile each, 377 
own two automobiles each, 264 own 3 automobiles each and 117 own 4 or more automobiles each. Write a 
frequency distribution for this problem. If one family is chosen randomly from these 1000 families, find the 
probability that this family owns 


(a) two automobiles 


(b) four or more automobiles. 
SOLUTION: 


Refer to Table E2.10. 


(a) P(family selected owns 2 automobiles) = 0.377 
(b) P(family selected owns 4 or more automobiles) = 0.117 
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Table E2.10 


Automobiles owned | Frequency | Relative frequency 


2.10 MUTUALLY EXCLUSIVE EVENTS 


Events that cannot occur together are known as mutually exclusive events. Such events do not have any 
common outcomes. If two or more events are mutually exclusive, then at most one of them will occur every 
time we repeat the experiment. Hence, the occurrence of one event excludes the occurrence of the other 
event or events. For example, consider tossing a balanced coin twice. This experiment has 4 outcomes: HH. 
HT, TH and TT. These outcomes are mutually exclusive since one and only one of them will occur when we 
toss this coin twice. 


Two or more events are mutually exclusive events if no two of them have outcomes in common. The 
Venn diagram shown in Fig. 2.4 show the difference between two events that are mutually exclusive and 
two events that are not mutually exclusive. Similarly Fig. 2.5 shows three mutually exclusive events and 
two cases of three events that are not mutually exclusive. Two events are said to be independent. if the 
occurrence of one does not affect the probability of the occurrence of the other. Thus, if A and B are 
independent events, then either P(A | B) = P(A) or P(B| A) = P(B). If the occurrence of one event affects the 
probability of the occurrence of the other event, then the two events are said to be dependent events. 

Hence, two events will be dependent if either P(A | B) # P(A) or P(B | A) # P(B). Two events are either 
mutually exclusive or independent. That is, mutually exclusive, events are always dependent, and dependent 
and independent events are never exclusive. Similarly, dependent events may or may not be mutually 
exclusive. 

Two mutually exclusive events that are taken together include all the outcomes for an experiment are 
called complementary event. Thus, the complement of event A, denoted by A _ is the event that includes all 
the outcomes for an experiment that are not in A. It is clear that 


P(A)+ P(A) =1 (2.7) 


Also P(A) =1- P(A) and P(A) =1- P(A) (2.8) 


The intersection of two events is given by the outcomes that are common to both events. If A and B 
are two events defined in a sample space, then, the intersection of A and B represent the collection of all 
outcomes that are common to both A and B and is denoted by A and B or AB or BA. 


Common outcomes 


(a) (0) 
Fig. 2.4 
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(a) 
say 2.5 


2.11 INDEPENDENT AND DEPENDENT EVENTS 
Two events are said to be independent if the occurrence of one does not affect the probability of the 
occurrence of the other. In other words, A and B are independent events if either 

P(A | B) = P(A) or P(B | A) = P(B) (2.9) 


It can be shown that if one of these two conditions is true, then the second will also be true, and if one 
is not true then the second will also not be true. 


If the occurrence of one event affects the probability of the occurrence of the other event, then the 
two events are said to be dependent events. 


Using the probability notation, the two events will be dependent if either 
P(A | B) # P(A) (2.10) 
or P(B | A) # P(B) Cl 


2.12 COMPLEMENTARY EVENTS 


The complement event A denoted by A and read as “A bar” or “A complement” is the event that includes 
all the outcomes for an experiment that are not in A. 


Events A and A are complements of each other. The Venn diagram in Fig. 2.6 shows the complementary 


events A and A. 


>| 


Fig. 2.6: Venn diagram of two complementary events 


Since two complementary events are taken together, they include all the outcomes for an experiment 
and because the sum of the probabilities of all outcomes is 1, it is clear that 
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P(A)+P(A)=1 (2.12) 
Also PAy=1— PA) (2.13) 
and P(A)=1-P(A) (2.14) 


Example E2.11 


A group of 2000 randomly selected adults were asked if they are in favour or against building a nuclear 
power plant in the city. The following table gives the results of this survey. 


In favour Against 


(a) If one person is selected at random from these 2000 adults, find the probability that this person is 
(i) in favour of the plant 
(ii) against the plant 
(iii) in favour of the plant given the person is a female 
(iv) amale given the person is against the plant 

(b) Are the events “male” and “in favour” mutually exclusive? What about the events “in favour” 
and “against”? Why or why not? 

(c) Are the events “female” and “male” independent? Why or why not? 


SOLUTION: 


i) Panfi ee yar 
(a) (i) (in favour) = 000 =": 


95 ae inst) = 2 e- = 0.475 
(ii) P(against) = 3000 = 2: 


650 
(iii) P(in favour|female) = i100 = 0.59091 


400 
(iv) P(malelagainst) = 950 = 0.42105 


(b) The events “male” and “in favour” are not mutually exclusive because they can occur together. 
The events “in favour” and “against” are mutually exclusive because they cannot occur together. 


1100 
= — =0.55 
(c) P(female) 2000 


650 
P(femalejin favour) = 1150 ~ 0.56522 


Since these two probabilities are not equal, the events “female” and “in favour” are not independent. 
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Example E2.12 


Pt ch AN Oe 
There are a total of 170 practicing physicians in a big city. Of them, 60 are female and 30 are pediatricians. 
Of the 60 females, 10 are pediatricians. Are the events “female” and “pediatrician” independent? Are they 
mutually exclusive? Explain why or why not. 


SOLUTION: 


30 
P(pediatrician) = ia = 0.17647 


10 
P(pediatrician|female) = 60 = 0.16667 


Since these two probabilities are not equal, the events “female” and “pediatrician” are not independent. 
The events are not mutually exclusive because they can occur together. 


Example E2.13 


A company hired 50 new graduates last month. Of these 25 are male and 15 are business majors. Of the 25 
males, 15 are business majors. Are the events “male” and “business major” independent? Are they mutually 
exclusive? Explain why or why not. 


SOLUTION: 


{5 
P(business major) = ZO) 0.30 


is) 
P(business major|male) = 35> 0.60 


Since these two probabilities are not equal, the events “male” and “business major” are not independent. 
The events are not mutually exclusive because they can occur together. 


Example E2.14 


Let A be the event that a number less than 3 is obtained if a die is rolled once. What is the probability of A? 
What is the complementary event of A, and what is its probability? 


SOLUTION: 

Event A will occur if either 1-spot or a 2-spot is obtained on the die. Hence, 
2 0.3334 

6 = ct 
The complementary event of A is that either a 3-spot or 4-spot or a 5-spot, or a 6-spot is obtained on the 
die. Hence, 


PU) 


P(A )=1-0.3334 = 0.6666 


2.13 INTERSECTION OF EVENTS AND MULTIPLICATION RULE | 


In this section, we introduce the intersection of two events and the application of multiplication rule to find 
the probability of the intersection of events. 


2.13.1 Intersection of Events 


Let A and B be two events defined in a sample space. The intersection of A and B represents the collection 
of all outcomes that are common to both A and B and is denoted by A and B. The intersection of events A 
and B is also denoted by either ANB or AB. Figure 2.7 shows the intersection of events A and B. The 
shaded area in Fig. 2.7 gives the intersection of events A and B. 


Intersection of A and B 


Fig. 2.7: Intersection of events A and B 


2.13.2 Multiplication Rule 


The probability that events A and B happen together is called the joint probability of A and B and is written 
as P(A and B). The probability of the intersection of two events is obtained by multiplying the marginal 
probability of one event by the conditional probability of second event. The rule is called the multiplication 
rule. Hence, 


P(A and B) = P(A) P(BIA) (2.15) 
The joint probability of events A and B can also be denoted by P(AMB) or P(AB). 


Hence, for any two events, their joint probability is equal to the probability that one of the events 
occurs times the conditional probability of the other given event. 


Conditional Probability 


If A and B are two events, then, 


P(A and B) 


P(B| A) = Aer (2.16) 
_ P(Aand B) 2.17) 
and P(A| B)= PB) 


given that P(A) #0 and P(B) #0. 
Hence, for any two events A and B, the conditional probability that one event occurs given that the 
other event has occurred equals the joint probability of the two events divided by the probability of the 


given event. 


Multiplication Rule for Independent Events 
The probability of the intersection of two independent events A and B is 


P(A and B) = P(A) P(B) (2.18) 
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The Special Multiplication Rule 
The general multiplication rule for any two events A and B is given by Eq. (2.16) 
P(A and B) = P(A)-P(B | A) 
If A and B are independent events, then 
P(B| A) = P(B) (2.19) 
Hence, for the special case of independent events, we replace the term P(B| A) in the general multiplication 
rule by the term P(B). 
If A and B are independent events, then 
P(A and B) = P(A) - P(B) (2.20) 
and conversely, if 
P(A and B) = P(A) P(B) 
then A and B are independent events. Hence, two events are independent if and only if their joint probability 
equals the product of their marginal probabilities. 


Similarly, if events A, B, C,... are independent, then 
P(A and B and C and ...)= F(A) < PB) P(C) ... (2.21) 
Mutually exclusive events are those that cannot occur simultaneously. Independent events are those 


for which the occurrence of some does not affect the probabilities of the others occurring. Two or more 
(non-impossible) events cannot be both mutually exclusive and independent. 


Total Probability Rule 


Events A,,A,,A,, ..., A, are said to be exhaustive, if one or more of them must occur. Suppose events Ay sag 
A,, .... A, are mutually exclusive and exhaustive; that is exactly one of the events must occur. Then for any 
event B 


k 
P(B) = ) P(A;)-P(B| A;) 


al 


(2.22) 


Joint Probability of Mutually Exclusive Events 


The joint probability of two mutually exclusive events is always 0. If A and B are two mutually exclusive 
events, then 


P(A and B) =0 (2:23) 


Example E2.15 


In a group of 10 adults, 4 have type A personality and 6 have a type B personality. If two persons are 
selected at random from this group of 10, what is the probability that the first of them has a type A personality 
and the second has a type B personality? Draw a tree diagram for this experiment. 


SOLUTION: 


Let C = first person selected has a type A personality 
D = first person selected has a type B personality 
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E = second person selected has a type A personality 
F = second person selected has a type B personality 
Figure E2.15 shows the tree diagram for the experiment. 


. The probability that the first person has a type A personality and the second has a type B personality 
is given by 


AON 6: 
P(C and F) = P(C) P (FIC) = ae = 0.2667. 
Final outcomes 


P(C and E) = (= (2 | = 0.1334 


Second person 


E/C 


First person 


4\{6 

P(C and F) = (*) () = 0.2667 
Ss 
10 


P(D and E) = | 


P(D and F) = (| (=) = 013334 
10/\9 
Fig. E2.15: Tree diagram 


Example E2.16 
Ten per cent of all items sold by a mail order company are returned by customers for a refund. Find the 
probability that of two items sold during a given hour by this company. 
(a) both will be returned for a refund 
(b) neither will be returned for a refund 
Draw a tree diagram for this experiment. 


SOLUTION: 
et A = first item is returned 
B = first item is not returned 
C = second item is returned 
D = second item is not returned 
(a) P(Aand C)= P(A) P(C) = (0.10)(0.10) = 0.01 
(b) P(Band D) = P(B) P(D) = (0.90)(0.90) = 0.81 
The tree diagram for this equipment is shown in Fig. E2.16. 
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Final outcome 


Second item 


P(A and C) = (0.10)(0.10) = 0.01 


First item 


P(A and D) = (0.10)(0.90) = 0.09 


P(B and C) = (0.90)(0.10) = 0.09 


P(B and D) = (0.90)(0.90) = 0.81 


Fig. E2.16: Tree diagram 


Example E2.17 


The probability that a farmer is in debt is 0.80. What is the probability that three randomly selected farmers 
are all in debt? Assume independence of events. 


SOLUTION: 
Let D, = the first farmer is in debt 
D, = the second farmer is in debt 
D, = the third farmer is in debt 
Hence, P(D, and D, and D,) = P(D,)P(D,)P(D,) = (0.80)(0.80)(0.80) = 0.512 


Example E2.18 


According to some private estimates, the probability that a randomly selected student from the population 
of all students enrolled in all institutions of higher education in a particular state is a female is 0.60, and the 
probability that this student is a female and a part-time student is 0.25. What is the probability that this 
student is part-time given that the student is a female? 


SOLUTION: 
Let F = student selected is a female 
PT = student selected is a part-time 
It is given that 
P(F) = 0.60 and P(PT and F) = 0.25 


Hence, PORE ye os eee 


P(F) —-0.60 
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2.14. UNION OF EVENTS AND THE ADDITION RULE 


In this section, we will discuss the union of events and the addition rule to compute the probability of the 
union of events. 


2.14.1 Union of Events 


The union of two events A and B includes all outcomes that are either in A or in B or in both A and B. Let A 
and B be two events defined in a sample space. The union of events A and B is the collection of all outcomes 
that belong either to A or to B or to both A and B and is denoted by A or B. 


The union of events A and B is also denoted by “AUB”. 


2.14.2 Addition Rule 
The method used to calculate the probability of the union of events is called the addition rule. 
The probability of the union of two events A and B is 
P(A or B) = P(A) + P(B) — P(A and B) 
where P(A) and P(B) are the marginal probabilities of A and B respectively and P(A and B) is the joint 
probability of A and B. 
Addition Rule for Mutually Exclusive Events 
The probability of the union of two mutually exclusive events A and B is 
P(A or B) = P(A) + P(B) (2.24) 


Special Addition Rule 


If event A and event B are mutually exclusive, then 


P(A or B )= P(A) + P(B) (225) 
Generalizing, if events A, B, C, ... are mutually exclusive, then 
P(A or Bor Cor ...)=P (A) +P (B)+P(C)+°*: (2.26) 


Therefore, for mutually exclusive events, the probability that one or another of the events occurs equals 
to the sum of the individual probabilities. 


The General Addition Rule 
For events that are not mutually exclusive, the general addition rule is applied. Referring to Fig. 2.8, we see 
that 
P(A or B) = P(A) + P(B) — P(A and B) (2.27) 
For any two events, the probability that one or the other occurs equals the sum of the individual 
probabilities less the probability that both occur. 


Fig. 2.8: Non-mutually exclusive events 
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Example E2.19 


All 500 employees of a company were asked whether they are smokers or non-smokers of cigarettes and 
whether or not they are college graduates. Based on this information, the following two-way classification 
table was prepared: 


Smoker 


Non-smoker 


Suppose one employee is selected at random from this company. Find the following probabilities: 
(a) P(college graduate or smoker) 
(b) P(smoker or not a college graduate) 

(c) P(smoker or non-smoker) 


SOLUTION: 
Let S = smoker 
N = non-smoker 
C = college graduate 
D = not a college graduate 


(a) P(C or S)= P(C) + P(S)— P(C and S)= San + 5997 S99 =: 


155, 295_100 _ 350 _ 4 
500 500 500 500° 
(c) Since S and N are mutually exclusive events 


155 345 
P(S or N) = P(S)+ PIN) = Sot on = 


(b) P(S or D) = P(S) + P(D) — P(S and D) = 


He) 


Example E2.20 


The probability that a family owns a desktop computer is 0.78 that it owns a DVD player is 0.70, and that it 
owns both the computer and a DVD is 0.60. Find the probability that a randomly selected family owns a 
computer or a DVD player. 


SOLUTION: 
bet C = family selected owns a computer 
D = family selected owns a DVD player 
Then, P(C or D)= P(C) + P(D) - P(C and D) = 0.78 + 0.70 — 0.60 = 0.88 


Example E2.21 


The probability that an open-heart surgery is successful is 0.85. What is the probability that in two randomly 
selected open-heart surgeries at least one will be successful? Draw a tree diagram for this experiment. 
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SOLUTION: 


Let A = first open-heart surgery is successful 
B = first open-heart surgery is not successful 
C = second open-heart surgery is successful 
D = second open-heart surgery is not successful 
P(at least one open-heart Surgery is successful) = P(A and C) + P(A and D) + P(B and G) 
= (0.85)(0.85) + (0.85)(0.15) + (0.15)(0.85) = 0.7225 + 0.1275 + 0.1275 = 0.9775 
The tree diagram is shown in Fig. E2.21. 


Second open-heart 
surgery 


P(A and C) = (0.85)(0.85) = 0.7225 


First open-heart 
surgery _ 


P(A and D) = (0.85)(0.15) = 0.1275 


P(B and C) = (0.15)(0.85) = 0.1275 


P(B and D) = (0.15)(0.15) = 0.0225 
Fig. E2.21 


Example E2.22 


(a) Inaclass of 35 students, 11 are seniors, 8 are juniors, 9 are sophomores, and 7 are freshmen. If 
one student is selected at random from this class, what is the probability that this student is 


(i) ajunior 
(ii) afreshman 

(b) If two students are selected at random from this class of 35 students, find the probability that the 
first student selected is a junior and the second is a sophomore. 


SOLUTION: 


8 
(a) (i) P(student selected is a junior) = 35 = 0.22857 


7 
(ii) P(student selected is a freshman) = 35 = (2 


(b) Let J, = first student selected is a junior 
S, = second student selected is a sophomore 
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8. \/ 39 
J)= (=| 2] = 0.05878 
Example E2.23 


tcc lala EEE 
A manufacturing company makes automobile steering wheels. The manufacturing system involves two 
independent processing machines so that each steering wheel passes through these two processes. The 
probability that the first processing machine is not working properly at any time is 0.09, and the probability 
that the second machine is not working properly at any time is 0.065. Find the probability that both machines 
will not be working properly at any given time. 


P(J, and S,) = PU,) P(S, 


SOLUTION: 
P(both machines not working properly) 


= P(first machine is not working properly) P(second machine not working properly) 
= (0.09)(0.065) = 0.00585 


2.15 BAYE’S FORMULA | | : 


Baye’s formula is quite useful in modifying a probability estimate as additional information becomes availiable. 
We have seen that a conditional probability is one in which the probability of the event depends upon 
whether the other event has occurred. 


P(A/B) = a) 
(A/B) = P(B) (2.28) 
or P(B/A) = ea 
i P(A) (2.29) 
From Eqs. (2.28) and (2.29), we have 
P(AB) = P(A/B) P(B) = P(BIA) P(A) (2.30) 
since AB=BA 
P(A/B) P(A) 
and P(A/B) = ————— 
(A/B) P(B) (2.31) 
Since P(A) + P(A ) = 1, it follows that event B occurs jointly with either A or A . 
P(B) = P(AB) + P(A B) 
and from Eq. (2.30) 
P(B) = P(A) P(B/A) + P(A ) P(BIA ) (2.32) 


Substituting Eq. (2.32) into Eq. (2.31), we get 


P(A) P(BIA) 


PIAIB) = =A) (BIA) + P(A) P(BIA) (2.33) 
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If event A has more than two available alternatives, we can write Eq. (2.33) as 
P(A,) P(BIA,) 


P Se ee 
a) >) P(A;) P(BIA,) (2.34) 


Example E2.24 


Find the joint probability of A and B for the following: 


(a) P(B)=0.60 and P(A/B) = 0.70 
(b) P(A)=02 and P(B/A) = 0.15 
SOLUTION: 


(a) P(A and B) = P(B and A) = P(B) P(A/B) = (0.60)(0.70) = 0.42 
(b) P(A and B) = P(A) P(B/A) = (0.2)(0.15) = 0.03 


Example E2.25 


A certain manufacturing company has 456 employees. Based on the information obtained by asking all the 
employees whether they are smokers or non-smokers and whether or not they are married, the following 
two-way classification table of data was obtained. 


Smoker/Non-smoker 


72 109 
226 


Married | Not marie | Total 


Non-smoker 


(a) If one employee is selected at random from this company, find the probabilities of the following: 
(i) P(Married and non-smoker) 
(ii) P(Smoker and not married) 

(b) Draw a tree diagram and compute all the joint probabilities for the given data. 


SOLUTION: 
(a) Let M = Married 
NM = Not married 
S = Smoker 
NS = Non-smoker 


158 \( 12 


= =| —]| —| =0.2653 
(i) P(M and NS) = P(M) P(NS/M) (3) 0 


LOST 72 
(ii) P(S and NM) = P(S) P(NM/S) = ealeq Ne ie) 


(b) The tree diagram is shown in Fig. E2.25. 
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158) ( 37 
P(M and S) = (=) (=| = 0.0811 


158 \ (21 
P(M and NS) = (=) (=) =0)2655 


456} \ 158 
298) ( 72 
Mand Sy eae 
Bena) (22)( 5) 


9; 
P(NM and NS) = (=) (=) = 0.4956 
456) \ 298 


Fig. E2.25: Tree diagram 


Example E2.26 


eee eS 


Refer to the data of Example E2.25, if one employee is selected at random from the company, calculate the 
following probabilities. 


(a) P(married or smoker) 

(b) P(smoker or not married) 

(c) P(smoker or non-smoker) 
SOLUTION: 


Referring to the solution of Example E2.25, we have 


a) P(M or S)= P(M) + P(S)- P(M and S)=| 7-- |+| 3--|—| gag] = 0.5044 
(b) P(S or NM) = P(S) + P(NM) — P(S and N: -(2)+( 2 )-(Z) -o74 
vi saaipiand NM) ase) \ ang)! ate nea 


(c) Since S and NS are mutually exclusive events, 


P(S ot NS) = P( pws) =( 22 |+( 22) = 
SEA) = 2S) ENNS) =| Ae | AGE ea 
Example E2.27 


The probability that a manufacturer is in a debt is 0.7. What is the probability that three randomly selected 
manufacturers are all in debt? Assume independence of events. 


SOLUTION: 


Let M = first manufacturer in debt 
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M, = second manufacturer in debt 
M, = third manufacturer in debt 
Then, P(M , and M, and M,)= P(M,) P(M,) P(M,) = (0.7)(0.7)(0.7) = 0.343 


Example E2.28 


Find P(A or B) for the following: 
(a) P(A)=0.17, P(B) =0.5 and P(A and B) = 0.12 
(b) P(A)=0.8, P(B) =0.7 and P(A and B) = 0.6. 
SOLUTION: 
(a) P(AorB)= P(A) + P(B) — P(A and B) = 0.17 + 0.5 — 0.12 = 0.55 
(b) P(A or B)= P(A) + P(B)— P(A and B) = 0.8 + 0.7-0.6=0.9 


Example E2.29 
The average acceptability of three automobile transmissions from three suppliers is given below: 
Supplier Relative probability of supply Average acceptability 
1 0.56 0.97 
2 0.32 0.89 
5 0.12 0.77 


(a) Find the probability that any transmission received will perform acceptably 
(b) Find the probability that a particular transmission delivered by supplier 2 when it is known to have 
performed acceptably. 


SOLUTION: 
(a) P(A) =(0.56)(0.97) + (0.32)(0.89) + (0.12)(0.77) = 0.5432 + 0.2848 + 0.0924 = 0.9204 


P(T,)P(A/T,) (0.32)(0.89) _ 0.2848 
XP(T,) P(A/T) ~ (0.56)(0.97) + (0.32)(0.89) +(0.12)(0.77) 0.9204 


(b) PT,/A)= = 0.3094 


0.56 T, 0.97 

Se 

0.12 T, Oo 
Fig. E2.29 


2.16 ADDITIONAL EXAMPLES AND SOLUTIONS i 


Example E2.30 


Following is a percentage distribution for the number of years of school completed by adult employees. 


25 years old or over in a manufacturing company. 
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Years completed 


5-7 4 
12 40 
13-15 20 


16ormore | 18 | 


Determine the probability that an employee randomly selected 


Event 


G 


(a) has completed 8 years of school or less 

(b) has completed 12 years of school or less 

(c) has completed 13 years of school or more 

(d) interpret each of your answers in (a)-(c) in terms of percentages. 


SOLUTION: 
(a) P(the employee has completed 8 years of school or less) = P(A) + P(B) + P(C) 
= 0.02 + 0.04 + 0.05 = 0.11 
(b) P(the employee has completed !2 years of school or less) = P(A) + P(B) + P(C) + P(D) + P(E) 
= 0.02 + 0.04 + 0.05 + 0.11 + 0.40 = 0.62 
(c) P(the employee has completed 13 years of school or more) = P(F) + P(G) 
= 0.20 + 0.18 =0.38 
(d) The interpretation of each of the results above in terms of percentage is as follows: 
(i) 11% of the adults aged 25 years old and over have completed 8 years of school or less 
(ii) 62% of the adults aged 25 years old and over have completed 12 years of school or less 
(iii) 38% of the adults aged 25 years old and over have completed 13 years of school or more. 


Example E2.31 


In a particular corporation, 56% of the employees are white, 95% are male, and 53% are white males. For a 
randomly selected employee, let 


W = Event the employee selected is white 
M = Event the employee selected male 
(a) Find P(W), P(M) and P(W and M) 
(b) Determine P(W or M) and express the answer in percentages 
(c) Find the probability that a randomly selected employee is female. 


SOLUTION: 
(a) P(W)=0.56 
P(M) = 0.95 
P(W or M) = 0.53 
(b) P(WorM)=0.56 + 0.95 — 0.53 = 0.98 
98% of the employees are either white or male. 
(c) P(F)=1-0.95 =0.05 
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Example E2.32 


The number of two door cars and four door vehicles in use by the employees of a company by age are as 


shown in the following Table E2.32. 


Table E2.32 


z 
Si A; | __20 
I2andover A, Ci 
eat 67 


For a randomly selected vehicle (2 Door or 4 Door), determine the probability that the vehicle selected 


(a) 
(b) 
(c) 
(d) 
(e) 


SOLUTION: 


is under 6 years old 

is under 6 years old, given that it is a2 Door 

is a2 Door 

is a 2 Door; given that it is under 6 years old 

interpret your answers in (a) to (c) in terms of percentages. 


The probability that the vehicle selected is 


(a) Under 6 years old is 70/140 = 0.5 

(b) Under 6 years old, given that it is a 2 Door car is 25/73 = 0.343 
(c) A2door car is 73/140 = 0.521 

(d) A 2 door car; given that it is under 6 years old is 25/70 = 0.357 
(e) (i) 50% of all vehicles are under 6 years old 


(ii) 34.3% of all 2 door cars are under 6 years old 
(iii) 52.1% of all vehicles are 2 door 
(iv) 35.7% of all vehicles under 6 years old are 2 door cars. 


mple E2.33 


Table E2.33 presents a joint probability distribution for engineers and scientists by highest degree obtained 


in a particular R&D corporation. 
Table E2.33 


Highest degree 


[Masters.D2 | 0.10 
0.005 
0.48 


Ue 
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Determine the probability that the person selected 
(a) is an engineer 
(b) has a doctorate 
(c) is an engineer with a doctorate 
(d) is an engineer given the person has a doctorate 
(e) has a doctorate, given the person is an engineer 


(f) interpret your answers in (a)-(e) in terms of percentages. 


SOLUTION: 
The probability that the person selected 
(a) is an engineer is 0.48. 
(b) has a doctorate is 0.045. 
(c) is an engineer with a doctorate is 0.02. 
(d) is an engineer given the person has a doctorate is 0.02/0.045 = 0.444. 
(e) has a doctorate, given that the person is an engineer is 0.02/0.48 = 0.0417. 
(f) (i) 48% of all engineers and scientists are engineers. 
(ii) 4.5% of all engineers and scientists have doctorates. 
(iii) 2% of all engineers and scientists are engineers with a doctorate. 
(iv) 44.4% of all engineers and scientists with doctorates are engineers. 
(v) 4.17% of all engineers have doctorates. 


Example E2.34 


A frequency distribution for the class level of students in “Introduction to JAVA programming” course is 
as given in Table E2.34. 


Table E2.34 


[Freshman [| 5 | 
| Sophomore | __14__| 
ore 
eae 


Two students are randomly selected without replacement. Find the probability that 

(a) the first student selected is a junior and the second a senior 

(b) both students selected are sophomores 

(c) draw a tree diagram for the solution of all the probabilities 

(d) what is the probability that one of the students selected is a freshman and the other a sophomore? 
SOLUTION: 
Let F = Freshman on selection i, where i = 1, 2 


S, = Sophomore on selection i, where i = 1, 2 
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J, = Junior on selection i, where i = 1, 2 

H, = Senior on selection i, where i = I, 2 
(a) P(J, and H,) = (11/36)/(6/35) = 0.0524 
(b) P(S, and S,) = (14/36)/(13/35) = 0.1444 
(c) The tree diagram is shown in Fig. E2.34. 


(d) The probability that one of the students selected is a freshman and the other student selected is a 
sophomore is: 


P(F, and S,) + P(S, and F,) = 0.0555 + 0.0555 = 0.111 


Event Probability 
Fo F,and F, 5.4/36.5 = 0.0159 
F, SO, F,and SO, 5.14/36.35 = 0.0555 
J F,and J, 5.11/36.35 = 0.0436 


SE, F,and SE, 5.6/36.35 = 0.0238 


Freshman 
Fs SO and F, 14.5/36.35 = 0.0555 
so SO, SOandSO, 14.13/36.35 = 0.1444 
Sophomore 
abe SO and J, 14.11/36.35 = 0.1222 
SE, SO, andSE, 14.6/36.35 = 0.0666 
Start 
F, J, and F, 11.5/36.35 = 0.0436 
junior J, SO, J,and SO, 14.14/36.35 = 0.1222 
dh J, and J, 11.10/36.35 = 0.0873 
Senior SE, J, andSE, 11.6/36.35 = 0.0523 


F, SEandF, — 6.5/36.35 = 0.0238 
SE SO, SEandSO,  6.14/36.35 = 0.0666 


Jo SE and J, 6.11/36.35 = 0.0523 
SE, SEandSE, 6.5/36.35 = 0.0238 


Fig. E2.34: Tree diagram 


Refer to Example E2.33 
(a) determine P(T,), P(D,) and P(T, and D,) 
(b) are T, and D, independent events? Why? 


SOLUTION: 
(a) P(T,)=0.52 
P(D,) = 0.045 


P(T, and D,) = 0.025 
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(b) The special multiplication rule states that two events T, and D, are independent if, 
P(T, and D,) = P(T,): P(D,) 
Using the data in (a) 
P(T,) - P(D,) = (0.52)(0.045) = 0.0234 
Since this product does not equal P(T, and D,) = 0.025, the events T, and D, are not independent. 


Example E2.36 


The two-way classification of data of 2000 randomly selected employees of a manufacturing company from 
a city based on gender and commuting time from residence to work place is given in Table E2.36. 


Table E2.36 


Gender Commuting time from residence to work place 


Less than 30 min | 30 min to | hour 
Men 525 450 225 
700 2000 


(a) If one employee is selected at random from these 2000 employees, find the probability that this 
employee 


(i) commutes from more than 1 hour 
(ii) commute for less than 30 minutes 
(iii) is aman commuting for 30 min to | hour 
(iv) is a woman commuting for more than | hour. 
(b) Determine if the following events are mutually exclusive 
(i) ‘men’ and ‘commuting for more than 1 hour’ 
(it) ‘less than 30 min’ ‘more than | hour’ 
(iii) ‘woman’ and ‘commutes for 30 minutes to 1 hour’. Are these events independent? 


SOLUTION: 


365 
(a) (i) P(commutes for more than 1 hour) = —— = 0.1825 


2000 

(ii) P¢ tes for less than 30 mi ts 46 

ii = 

commutes for less than 30 min.) 000 0.4675 

eee . 450 
(iii) P(commutes for 30 min to 1 hour) = 700 = 0.6428 

; 140 
(iv) P(commutes for more than 1 hour/woman) = 300 =O175 


(b) (i) The term ‘man’ and ‘commutes for more than 1 hour’ are not mutually exclusive. The two 
events can occur together. 


(11) The events ‘less than 30 minutes’ and ‘more than 1 hour’ are mutually exclusive because 
they cannot occur together. 
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800 
(iii) P(woman) = —— =0.4 
\ ) 2000 


P(woman/commutes for 30 min to 1 hour) = on (sms 


Since these two probabilities are not equal, the events ‘woman’ and ‘commute for 30 minutes to 1 hour’ are 
not independent. 


The term probability is used to describe the uncertainty probability, which measures the likelihood that an 
event will occur, is an important part of statistics. Probability theory is used extensively in analysing decision- 
making situations that involve risk or incomplete information. 


PROBLEMS 


P2.1_ In Fig. P2.1, E, T, and N are the events that an automobile brought to a service garage needs an 
engine oil-change, tune-up, or new tires. Express in words the events represented by 
(a) region | 
(b) region 3 
(c) region 7 
(d) regions 1 and 4 together 
(e) regions 2 and 5 together 
(f) regions 3, 5, 6, and 8 together. 


Fig. P2.1: Venn diagram 


P2.2 In Problem 2.1, list the region or combinations of regions representing the events that an automobile 
brought to the garage needs 
(a) tune-up, but neither engine oil-change nor new tires 
(b) an engine oil-change and tune-up 
(c) tune-up or new tires, but not an engine oil-change 
(d) new tires. 


P2.3. Anelectronic security system may be used in any one of the three modes. in any two of the three 
modes, or in all three modes: manual (/), semi-automatic (S) and automatic (A). In tests on 200 


such systems, the following uses were found. 
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P2.4 


P2.5 


P2.6 


(a) 


(b) 


An age distribution of 100 employees at a manufacturing company are given as follows: 


Number 
30 
40 
20 
40 
All remaining | M-only 


define M, S, and A as sets of systems used. Draw a Venn diagram, and show the number of 


systems in each use 


how many systems are in M? In S? In (MUS)? 


Under 40 
40-49 
50-59 
60-69 
70 and over 


For an employee selected at random, let 


(a) 
(b) 
(c) 
(d) 


Refer to Problem P2.4. Find the probability that a randomly selected employee is 


(a) 


A : event the employee is under 40 

B : event the employees is in his or here 40’s 
C : event the employee is in his or her 50’s 

S : event the employee is under 60 

use the tables and the f/n rule to find P(S) 
express event S in terms of events A, B and C 


determine P(A), P(B) and P(C) 


complete P(S) using special addition rule and write the answers from parts (b) and (c). Also 


compare the answer in part (a). 


40 years old or under 


(b) under 60 years old. 


In a particular company, 50% of the adult employees are female, 10% are divorced, and 5% are 


divorced females. For an employee selected at random, let 


F = event the person is female 


D = event the person is divorced. 


/! Probability // 69 


eee OILY HOS 


P2.7 
P2.8 


P2.9 


P2.10 


P2.11 


P2.12 


P2.13 


P2.14 


(a) find P(F), P(D) and P(F and D) 
(b) determine P(F or D) and interpret your answer in terms of percentages 


(c) find the probability that a randomly selected adult is male. 
A fair coin is tossed five times. Find the probability of getting at least one tail. 


Suppose we randomly select two students from a typical co-ed class and observe whether the 
student selected each time is a male or a female. Write all the outcomes for this experiment. Draw 
the Venn and tree diagrams for the experiment. 


In a group of freshman engineering class, some are in favour of mechanical engineering and others 
are against it. Two students are selected at random from this class and asked whether they are in 
favour of or against mechanical engineering. How many distinct outcomes are possible? Draw a 
Venn diagram and a tree diagram for this experiment. List all the outcomes included in each of the 
following events and state whether they are simple or compound events. 

(a) both students are in favour of mechanical engineering 

(b) at most one student is against mechanical engineering 

(c) exactly one student in favour of mechanical engineering. 

Determine the conditional probability P(in favour/male) for the response data in Table P2.10 for 


the 100 employees. They were asked whether they are in favour or against paying higher salaries 
to CEOs of corporations. 


Table P2.10 


Draw the tree diagram to illustrate the computations. 

Refer to Problem P2.10. Find the conditional probability that a randomly selected employee is a 

female given that this employee is in favour of paying high salaries to the CEOs. Draw the tree 

diagram for this computation. 

(a) Determine the probability of getting 3 tails in tossing 3 coins. 

(b) Two dice are thrown. Draw the sample space. Determine the probability of getting 6 in both 
the dice. 

(c) Determine the probability of getting (i) exactly two tails (event A) and (ii) at least two tails 
(event B) in tossing 3 coins. 

(a) Determine the probability that the sum of the numbers shown on the two faces. when two 
dice are thrown 
(i) is7 
(ii) is 10 

(b) A die is thrown. Determine the probability of getting either an even number or a number 
greater than 4 or both. 

Table P2.14 gives the contingency table for age and rank of employees in a particular company. If 

an employee is selected at random, 
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(a) find the unconditional probability that the employee selected is in his/her 50’s 

(b) find the unconditional probability that the employee selected is in his or her 50’s given that 
Grade | employee is selected 

(c) interpret the probabilities found in parts (a) and (b) in terms of percentages. 


Table P2.14 


S| sea 
i 
150 | 120 


Under 30 A, 


30-39 A> 
Age | 40-49 A; | 60 | 
50-59 A, | 150] 60 | 30 | 


P2.15 Refer to Problem P2.14. Compute the conditional probability P(A,/R,) using unconditional 
probabilities. 


P2.16 Data on the marital status of employees in a company are as given in the table below. The table 
provides a joint probability distribution for the marital status of adults by sex. The term “Single” is 
used for “Never Married”. 

Table P2.16 
Marital status 


Single | Married | Widowed | Divorced | P(S,) 


M, M; M, My 
Sex 
0.25 | 0.65 | 0.04 | 0.06 | 1.00 | 


If an employee is selected at random, 


(a) find the probability that the employee selected is divorced, given that the employee selected 
is male, given that the employee selected is divorced. 


(b) use the conditional probability rule for (a). 


P2.17_ In the research and development department of a company, the number of male and female 
employees are shown in the frequency distribution as given below in a tabular form. 


Two employees selected at random from this department. The first employee obtained is not returned 
to the department for possible reselection; that is, the sampling is without replacement. 

(a) determine the probability that the first employee obtained is female and the second is male 
(b) draw the tree diagram. 
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P2.19 


P2.20 
P2.21 


P2.22 


P2.23 


P2.24 


P2.25 


P2.26 


A bin contains 20 machine parts, 4 of which are defective. If 2 machine parts are selected at random 


(without replacement) from this box, what is the probability that both parts are defective? Draw 
the tree diagram. 


Table P2.19 gives the classification of all employees of a particular department in a large corporation 
by gender and college degree. 


Table P2.19 


College graduate 


Not a college graduate 
(N) 


Male (M) 
Female (F) 


If one of these employees is selected at random for membership on the planning committee. 
determine the probability that this employee is a female and a college graduate. Draw the tree 
diagram. 

A balanced coin is tossed five times. Determine the probability of getting at least one head. 


The probability that a patient is allergic to penicillin is 0.10. If the drug is administered to three 
individual patients, determine the probability that 

(a) all three of them are allergic to it 

(b) at least one of them is not allergic to it. 

(c) draw the tree diagram. 


Draw the Venn and tree diagram for selecting three times two persons (a man or a woman) from the 

members of a club. 

The search committee of the school of engineering that plans to hire one new faculty member has 

prepared a final of five candidates, all of whom are equally qualified. Out of these five candidates 

two of them are woman. If the committee decides to select at random one person out of these five 

candidates, find the probability that this person will be a woman or a man? Also find the sum of 

these two probabilities and comment the result. 

In a random sample of 1000 companies surveyed showed that 35 of them have no design engineer, 

210 have one design engineer each, 379 have two design engineers, 270 have three design engineers 

and 106 have four or more design engineers. 

(a) prepare a frequency distribution table and calculate the relative frequencies for all categories 

(b) if one company is selected randomly from these 1000 companies, find the probability that 
this company has to design engineers 

(c) find the probability that the company selected has four or more design engineers. 


Of the 25 new graduates hired by a company, 14 are female and 9 are mechanical engineering majors. 
Of the 14 females, 6 are mechanical engineering majors. 

(a) are these events “female” and “mechanical engineering major” independent? 

(b) are these events “mutually exclusive”? 

An individual is picked at random from a group of 56 athletes. Suppose 26 of the athletes are 
female and 6 of them are swimmers. Also, there are 10 swimmers among the males. 
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P2.28 


2,29 


P2.30 


P2.31 


P2.32 


(a) find the probability that she is a swimmer if that the individual picked is a female 
(b) given that the individual picked is a swimmer, find the probability that the person is a male. 


(a) The probability that the stock market goes up on Monday is 0.7. Given that it goes up on 
Monday, the probability that it goes up on Tuesday is 0.3. Find the probability that the market 
goes up on both days. 

(b) The probability that a student passes the midterm exam is 0.7. Given that a student fails the 
midterm exam, the probability that the student passes the final exam is 0.8. Find the probability 
that John (a student in that class) fails the midterm and passes the final. 

(c) Past records show that the probability that the Chairman of the board attends a meeting is 
0.7, that the President of the company attends a meeting is 0.8, and that they both attend a 
meeting is 0.6. Can we say that the Chairman and the President act independently regarding 
their attendance at the meeting of the board of directors? 


Two students are selected at random from a class without replacement whose frequency distribution 
of males and females is given in Table P2.28. 


Table P2.28 


Gender | Frequency 
Male 7 


Female 23 

Total 40 
(a) find the probability that the first student selected is female and the second is male 
(b) draw a tree diagram. 


Given that A and B are two independent events, find their joint probability for the following data: 
(a) P(A)=0.25 and P(B) = 0.78 
(b) PiA)= 055 and PG) =0:23 


The following data gives a two-way classification of all faculty members of a university based on 
gender and tenure. 


|__| Tenured | Non-tenured | 

Male Ups; 48 

Female 34 19 

(a) if one of these faculty members is selected at random, find the following probabilities: 
(i) P(male and non-tenured) 


(ii) P(tenured and female) 
(b) find P(tenured and non-tenured). 


In a group of 12 employees of a company, 5 are graduates and 7 are non-graduates. If 2 persons 
are selected at random from this group, find the probability that the first of them is a graduate and 
the second is a non-graduate. Draw a tree diagram for the solution. 


The probability that an employee at a university is a female is 0.3. The probability that an employee 
is a female and married is 0.2. Find the conditional probability that a randomly selected female 
employee from this university is a married person. 
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P2.33 For the data given in Problem P2.30, if one of these faculty members is selected at random. find the 
following probabilities. 
(a) P(female or non-tenured) 
(b) P(tenured or male). 


P2.34 The probability that a new product introduced to the market is successful is 0.78. What is the 
probability that in two randomly selected new products introduced, at least one will be successful? 
Draw a tree diagram for the solution. 


P2.35 Consider the following events for one roll of a die 
A = an odd number is observed (1, 3, 5) 
B = an even number is observed (2, 4, 6) 
C= a number less than 5 is observed (1, 2, 3, 4) 
(a) Are events A and B mutually exclusive? 
(b) Are events B and C mutually exclusive? 


P2.36 An experimental submarine is estimated to have a probability of 0.90 for a successful first flight. In 
case of failure there is a 0.03 probability of a catastrophic explosion, in which case the abort system 
cannot be used. The abort system has a reliability of 0.97. Calculate the probability of every possible 
outcome of the initial flight. 


P2.37 With voltage fluctuations present, the probability of an electromechanical system going down is 
75%; the probability of it going down during no voltage fluctuations condition is only 8%. The 
probability of voltage fluctuations occurring is 20%. What is the probability that voltage fluctuations 
are present given that the electro mechanical system is down? 


P2.38 Out of a collection of 100 gaskets the following are defective as indicated: 


Type I defect 10 
Type II defect 

Type III defect 

Type I and II defect 
Type I, II and Il defect 
No defect 


(a) if a gasket picked out the collection has a type II defect, what is the probability that it also 
has a type I defect? 

(b) if the gasket picked has a type I defect, what is the probability that it also has a type II 
defect? 

P2.39 Three persons: A, B and C are working independently to solve a mathematical puzzle. Based on 
their previous performance, the probabilities that they will succeed are 0.1, 0.2 and 0.3 respectively. 
What is the probability that the puzzle will be solved? 

P2.40 A system consists of 25 different components, of which one; component A, has been identified as 
the critical component. The probability that the critical component is non-defective is 0.97. If the 
critical component is non-defective, then the system succeeds with at probability of 0.99. If not. 
then the probability of the system success drops to (0.93. What is the probability of system success? 
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P2.47 


P2.48 


An electronic test set up has 97% probability of correctly identifying a defective component and a 
3% probability of identifying a non-defective component as defective. A batch of 100 components 
of which 3 are known as defective is subjected to this electronic testing. If the test identifies a 
component as defective, what is the probability that it is truly defective? 


A mechanical system assembly plant purchases a certain component from three independent 

vendors: A, B and C, who supply 20%, 30% and 50% respectively of the total number of components 

needed by the assembly plant. On an average, the percentage of defective components supplied 

by vendors A, B and C are 5%, 3% and 2% respectively. 

(a) if acomponent is selected at random, find the probability that it is defective 

(b) if the selected component is defective, what is the probability that it was supplied by vendor 
C? by vendor A? by vendor B? 


A bowl contains five white balls, two red balls, and three green balls. What is the probability of 
getting either a white ball or a red ball in one draw from the bowl? 


One bow! contains five white balls, two red balls, and three green balls. Another bowl contains 
three yellow balls and seven black balls. Determine the probability of getting a red ball from the 
first bowl and a yellow ball from the second bow] in one draw from each bowl. 


The probability that a student is in favour of engineering ethics course is 0.60 and a student is 
against is 0.40. Two students are randomly selected, and it is observed whether they favour or 
oppose engineering ethics course. 


(a) draw a tree diagram for this experiment 


(b) determine the probability that at least one of the two students favours engineering ethics 
course. 


Customers who buy a certain make of an automobile can order an engine in any of three sizes. Of 
all automobiles sold, 50% have the smallest engine, 30% have the medium-size one, and 20% have 
the largest. Of automobiles with the smallest engine, 15% fail the emissions test within two years 
of purchase, while 10% of those with the medium size and 15% of those with the largest engine 
fail. Determine the probability that a randomly chosen automobile will fail on emissions test within 
two years. 


A company manufactures aluminium soft drink cans. It was found that the probability that a can 
has a flaw on its side is 0.03, the probability that a can has a flaw on the top is 0.02, and the 
probability that a can has a flaw on both sides and the top is 0.01. 


(a) what is the probability that a randomly chosen can has a flaw? 

(b) what is the probability that it has no flaw? 

(c) what is the probability that a can will have a flaw on the side, given that it has a flaw on top? 

(d) what is the probability that a can will have a flaw on the top, given that it has a flaw on the 
side? 

(a) Two coins are tossed. Determine the conditional probability of getting two heads (event B) 
given that at least one coin shows a head (event A) 


(b) A box contains 5 red and 4 blue balls. Two balls are drawn one-by-one without replacement. 


Given that the first ball drawn is red, find the probability that both the balls drawn will be 
blue. 
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2.55 


P2.56 


2:57 


Of the gas turbine engine blades manufactured by a certain process, 15% are defective. Five turbine 


Si chosen at random. Assume they function independently. Find the probability that they 
all work. 


Refer Problem P2.49. What is the probability that at least one of the gas turbine engine blade 
manufactured works? 


In a certain city, 45% of the people consider themselves conservatives (C), 30% Consider themselves 
to be Liberals (L) and 25% consider themselves to be Independents (/). During the particular election, 
50% of the Conservatives voted, 40% of the Liberals voted, and 60% of the Independents voted. If 
a randomly selected person voted, determine the probability that the voter is 

(a) Conservative 

(b) Liberal 


(c) Independent. 


In a certain company employees, 5% of the men and 1% of the women are heavier than 100 kg. 
Furthermore, 70% of the employees are women. Suppose a randomly selected employee is heavier 
than 100 kg. Find the probability that the employee is a woman. 


In a certain engineering college hostel, 20% are freshmen of whom 10% own a car, 30% are 
sophomores of whom 20% own a car, 20% are juniors of whom 30% own a car, and 20% are seniors 
of whom 40% own a car. A student is randomly selected from the hostel. 

(a) find the probability that the student owns a car 

(b) if the student owns a car, find the probability that the student is a junior. 


At a certain grocery store, eggs come in cartons that hold a dozen eggs. The store experience 
indicates that 78% of the cartons have no broken eggs. 19% have one broken egg. 2% have two 
broken eggs, and 1% have three broken eggs, and that the percentage of cartons with four or 
more eggs broken eggs is negligible. An egg selected at random from a carton is found to be 
broken. What is the probability that this egg is the only broken one in the carton? 


The probability that a college student being male is 1/3 and that being female is 2/3. The probability 

that a male student completes the probability and statistics course successfully is 8/10 and that 

the female student does it is 4/5. A student is selected at random is found to have completed the 

course. Determine the probability that the student is a 

(a) male 

(b) female. 

In a car dealership, assume 55% of the cars are made in US and 15% of these are compact: 25% of 

the cars are made in Europe and 40% of these are compact: and finally, 20% are made in Japan and 

60% of these are compact. If a car is picked at random from this dealership: 

(a) find the probability that it is a compact 

(b) draw a tree diagram for part (a) 

(c) given that the car is a compact, find the probability that it is a European. 

In a survey of the evaluation of preliminary product designs by the customers, 90% of highly 

successful products received good reviews, 60% moderately successful products received good 

reviews, and 10% of poor products received good reviews. In addition, 40% of products have 

been highly successful, 35% have been moderately successful, and 25% have been poor products. 

(a) find the probability that a product attains a good review ' 

(b) if the new design attains a good review, what is the probability that it will be a highly successful 
product? 
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. REVIEW QUESTIONS - ; 


1. 


(c) if a product does not attain a good review, what is the probability that it will be a highly 
successful product? 


A cat is in a room and each of the four walls of the room has a door through which the cat could 
try to escape. However, there is a trap at each of the doors D,, D,, D, and D, and they work with 
probabilities 0.3, 0.2, 0.4 and 0.5 respectively. If the cat picks a door at random: 


(a) find the probability that the cat will escape 
(b) draw the tree diagram for the solution in part (a) 
(c) given that the cat escapes, find the probability that the cat chose door D, to escape. 


According to a particular survey, 8% of the population has a lung disease. Of those having lung 
disease, 90% are smokers; of those not having lung disease, 25% are smokers. Find the probability 
that a randomly selected smoker has lung disease. 


Three different machines m,, m, and m, were used for producing a large batch of similar manufactured 
items. Suppose that 30% of the items were produced by machine m,, 30% by machine m,, and 40% 
by machine m,. Also, 1% of the items produced by machine m, are defective, 2% of the items 
produced by machine m, are defective, and 3% of the items produced by machine m, are defective. 
If one item is selected at random from the entire batch, and it is found to be defective. Determine 
the probability that this item was produced machine m,. 


Define the following terms: 

(a) experiment (b) outcome 

(c) sample space (d) simple event 
(e) compound event. 

Describe the properties of probability. 


Describe an impossible event and a sure event. What is the probability of the occurrence of each 
of these two events? 


Describe the three approaches to probability. 

Explain the difference between the marginal and conditional probability of events. 
What is meant by two mutually exclusive events? 

Explain the meaning of independent and dependent events. 


What is the complement of an event? What is the sum of the probabilities of two complementary 
events? 


Explain the meaning of the following terms: 

(a) intersection of two events 

(b) the joint probability of two or more events 

(c) multiplication rule of probability 

(d) joint probability of two mutually exclusive events. 

Explain the meaning of the following: 

(a) union of two events (b) addition rule of probability 


(c) classical probability rule (d) equally likely outcomes 
(e) sample point. 


!! Probability // 77 
eee robdability tl 77 


STATE TRUE OR FALSE 
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The probability of an event is always between 0 and 1, inclusive. (True/False) 
The probability of an event that cannot occur is 1. (True/False) 
The probability of an event that must occur is 1. (True/False) 
An event that cannot occur is called a certain event. (True/False) 
An event that must occur is called a certain event. (True/False) 
An experiment is an action whose outcome cannot be predicted with certainty. (True/False) 
An event is some specified result that may or may not occur when an experiment is performed. 
(True/False) 
The experiment has a finite number of possible outcomes, all equally likely. (True/False) 
The probability of an event equals the ratio of the number of ways that the event can occur to the 
total number possible outcomes. (True/False) 
If a member is selected at random from a finite population, probabilities are identical to percentages 
(relative frequencies). (True/False) 
The probability of an event is the proportion of times it occurs in a small number of repetitions of 
the experiment. (True/False) 
Sample space is the collection of all possible outcomes for an experiment. (True/False) 


An event is a collection of outcomes for the experiment, that is, any subset of the sample space. 
(True/False) 


Two or more events are mutually exclusive events if no three of them have outcomes in common. 
(True/False) 


If event A and event B are mutually exclusive, so are events A, B and C for every event C. 


(True/False) 

If event A and event B are not mutually exclusive, neither are events A, B and C for every event C. 

(True/False) 

If E is an event, then P(E) represents the probability that event E occurs. (True/False) 

If event A and event B are mutually exclusive, then P(A or B) = P(A) + P(B). (True/False) 

For any event E, P(E) = 1 — P(not E). (True/False) 

If A and B are any two events, then P(A or B) = P(A) + P(B)—- P(A & B). (True/False) 
The probability that event B occurs given that event A occurs is called a conditional probability. 

‘ (True/False) 

If A and B are any two events with P(A) > 0, then P(BIA) = P(A & B)/P(B). (True/False) 

If A and B are any two events, then P(A & B) = P(A)/P(BIA). (True/False) 

Event B is said to be independent of event A if P(BJA) = P(B). (True/False) 

If A and B are independent events, then P(A & B) = P(A)-P(B). (True/False) 


Two or more events are said to be mutually exclusive if at most one of them can occur when the 
experiment is performed, that is, if no two of them have outcomes in common. (True/False) 
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27. For any two events, the probability that one or the other of the events occurs equals the sum of 


the two individual probabilities. (True/False) 


28. For any event, the probability that it occurs equals | minus the probability that it does not occur. 


(True/False) 
29. Data obtained by observing values of one variable of a population are called univariate data. 

(True/False) 
30. Data obtained by observing values of two variables of a population are called bivariate data. 

(True/False) 
31. The joint probability equals the product of the marginal probabilities. (True/False) 


- ANSWERS TO STATE TRUE OR FALSE eS 


1. True 2. False 3. True 4. False 5.True 6.True 7. True 8. True 9. True 10. True 
11. False 12. True 13. True 14. False 15. Falsei6. True 17. True 18. True 19. True 20. True 
21. True 22. False 23. False 24. True 25. True 26. True 27. False 28. True 29. True 30. True 
31. True 


005 


CHAPTER 


Random Variables 
and Probability Distributions 


The topic of random variables is fundamental to probability and statistics. These concepts are natural 
extensions of the ideas of variables and relative frequency distributions. 


This chapter introduces the basic concepts and definitions of random variables (both discrete and 
continuous) and their mean and standard deviation, and probability distributions. Permutations and 
combinations are defined briefly. Discrete probability distributions (hypergeometric. binomial and Poisson 
distributions) and their mean and standard deviations are derived. Continuous probability distribution, 
namely, the normal distribution, its properties, mean and variance and the standard normal distribution are 
presented. Approximating probability is also presented. They include Binomial approximation to the 
hypergeometric, Poisson approximate to the binomial and normal approximation to the binomial and rule for 
distribution are presented. 


3.1 RANDOM VARIABLES | 


This section introduces the important concept of a probability distribution, which gives the probability for 
each value of a variable that is determined by chance. This section also introduces procedures for finding the 
mean and standard deviation for a probability distribution. A random variable is a variable (typically represented 
by x) that has a single numerical value, determined by chance, for each outcome of a procedure. In other 
words, a random variable is a quantitative variable whose value depends on chance. In Chapter |, we defined 
a variable as a characteristic that varies from one member of a population to another. When one or more 
members are selected at random from the population, the variable, in that context, is called a random variable. 
Therefore, mathematically, a random variable is a function defined on the outcome of the sample space. 


A probability distribution is a description that gives the probability for each value of the random 
variable. It is often expressed in the format of a graph, table, or formula. A random variable is classified a 
discrete or continuous, depending upon the range of its values. (The range of a random variable is the set 


of values it can assume.) 


A discrete random variable has either a finite number of values or a countable number of values. 
where countable refers to the fact that there might be infinitely many values. but they can be associated 
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with a counting process. Thus, we often say that a discrete random variable can assume at most a countable 


infinite number of values. 


The number of eggs that a hen lays in a day, the salary of a new college graduate, the number of 
bacteria in a culture, the count of the number of students present in statistics class on a given day, the 
number of cavities that a child has, the price of gold on the exchange on a particular day, toss of a coin, the 
number of cars sold at a car dealership in a given month, color of a ball drawn from a collection of balls, 
etc., these are all examples of discrete random variables. 


A continuous random variable has infinitely many values, and those values can be associated with 
measurements on a continuous scale without gaps or interruptions. 


The maximum daily temperature, the amount of milk a cow produces in one day, the life of an electric 
bulb, the measure of voltage for a particular smoke detector battery, the distance by which a sharp shooter 
misses a target, the height of a person, the weight of a person, and the length of time a person has to wait 
at a bank counter, diameter of a manufactured shaft, time to failure of a machine component, repair time, the 
price of an automobile, the time taken to complete a medical examination, duration of a snow storm, etc. 
provide examples of continuous random variables. 


3.1.1 Discrete Random Variables 


Recall from Chapter | that the relative-frequency distribution or relative-frequency histogram of a discrete 
variable gives the possible values of the variable and the proportion of times each value occurs. We can 
extend the actions of relative-frequency distribution and relative-frequency histogram-concepts applying 
to variables of finite populations — to any discrete random variable. Here, we use the terms probability 
distribution and probability histogram. 


The probability distribution of a discrete random variable, assuming a finite number of values, can be 
described by listing all the values that the random variable can assume, together with the corresponding 
probabilities. Such a listing is called the probability function of the random variable. 


Probability histogram is a graph of the probability distribution that displays the possible values of a 
discrete random variable on the horizontal axis and the probabilities of those values on the vertical axis. 
The probability of each value is represented by a vertical bar whose height equals the probability. 


In general, if a random variable X assumes the values x,, x,, ..., x,, then we can represent the probability 
that X takes the values x, by p,. That is 


Ep 
The probability function can then be summarised in the form of a table, as shown in Table 3.1. By 


definition, a probability function gives the probabilities with which the values are assumed by the random 
variable. Hence, 0 < p (x) <1. 


Table 3.1: The probability function of a random variable 
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Similarly, a random variable has to assume one of its possible values. Therefore, Zp(x) = 1. 
Properties of a Probability Function: 
1. The probability that a random variable assumes a value x, is always between 0 and 1. 
Hence, O0Sp(x)<1 Gan 
2. The sum of all probabilities p(x.) is equal to 1. 
Hence, 2p(x)=1 (3.2) 
Let X be a random variable. If the number of possible values of X is finite or countably infinite, X is 
called a discrete random variable. Each possible outcome is x, A number, p(x,) = p(X = x,), gives the 


probability that the random variable X equals a value, x,. The numbers, p(x), fori = 1, 2, ..., must satisfy the 
following conditions: 
le po) 20 for all i 
2G) =1 (3.3) 
i=] 
5 OS7R = | 


The collection of pairs [x,, p(x,)], 1 = 1, 2, ... is called the probability distribution of X and p(x) is called 
the probability mass function (or pmf) of X. The probability distribution for the variable X is characterised 
in Fig. 3.1. The cumulative distribution function of a discrete random variable is defined as 


p(x = X;)= > P(%;) (3.4) 


x,<X 


p(x) 


0 1 2 3 


Fig. 3.1: Probability distribution of a discrete random variable x, 
The function is a step function that is constant over every interval not containing any of the points x. 


as shown in Fig. 3.2. 
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Fig. 3.2: Cumulative distribution function (CDF) of a discrete random variable x, 


3.1.2 Mean and Standard Deviation of a Discrete Random Variable 
Consider a set of values, say [x], i= 1, 2, ..., 2, each of which occurs with frequency f for i= 1, 2, ..., k. The 


k k 
mean value is the sum of all the x’s, hence > x,-f, divided by the total number of x’s, } f, = 7, or 
i=l 


i=] 


y f, i=} n (3.5) 


ay 3.6) 


The relative frequencies f/n could be regarded as probabilities or values of f(x). Hence, 


If a random variable X has the discrete probability function f(x), we define its mean or expected value 
or expectation as 


E(X) = w= 2x + fix) or Xx - p(x) (3.7) 
where the summation extends over all the values of x and p(x) are the probabilities. 
The variance of a random variable X with probability distribution f(x) is given by 


CS 2) 7) (3.8) 


where the summation extends over all the values of x. 
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The positive square root of 6, o, is called the standard deviation. 
We often denote the variance of X by V(X). 
It is often easier to use the following formula for 07: 
o* = Six? f(x)—w? or )) x? p(x) —p? (3.9) 
The variance of a random variable X is a measure of the dispersion or scatter in the possible values for 


X. The variance of X, denoted as 6? or V(X) is 


o° =V(X) =) (x—p) f(x) 


(3.10) 
V(X) uses weight f(x) as the multiplier of each possible squared deviation (x — 1)’. 
Properties of summations and the definition of 1 can be used to show that 
V(X) = Ye py? f(x) = YF) - WY xe fMew Y fo 
= Vix f(x)-2y? +? = Fx? f)- pw? (3.11) 
Therefore, an alternative formula for V(X) can be used. 
Summarising, we have 
The mean or expected value of the discrete random variable X, denoted as or E(x) is 
SEC) 1G) (3.12) 
The variance of X, denoted as 6? or V(X) is 
2 = V(X) = E(X —py = Vay) f@) =) er fa-w (3.13) 


The standard deviation of X is 
o=/V(X) (3.13a) 


Example E3.1 


The number of defects in a gas turbine blade manufactured by a certain process varies from blade to blade. 
Overall, 50% of the blades produced have no defects, 40% have | defect, 8% have 2 defects and 2% have 3 
defects. Let x be the number of defects in a randomly selected turbine blade manufactured by the process. 

(a) find p(x = 0), p(x = 1), p(x = 2) and p(x = 3) 

(b) plot the probability mass function of the random variable x 

(c) find F(2) and F(1.5) 

(d) plot the cumulative distribution function F(x) of the random variable x that represent the number 

of defects in a randomly chosen turbine blade 
(e) population mean of the sample random variable, x 
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SOLUTION: 
(a) P(x=0)=0.50, P(x = 1) = 0.40, P(x = 2) = 0.08 and P(x = 3) = 0.02 
(b) The plot of the probability mass function of the random variable, x is shown in Fig. E3.1(a). 


(c) 


(d) 


0.6 
0.5 
0.4 
0.3 
0.2 
0.1 

0 


Probability 


0 1 2 3 
Number of defects 


Fig. E3.1(a) 

Since F(2) = P(x < 2), we need to find P(x < 2). We can do this by summing the probabilities for the 
value of x that are less than or equal to 2, namely 0, 1, and 2. Hence 
F(2) = PQ $2) = Pw=0) + Px=1)+ «= 2) =0.50 + 0.40 + 0.08 = 0.98 
Now F(1.5) = P(x < 1.5). Hence, to compute F(1.5), we must sum the probability for the values of x 
that are less than or equal to 1.5, which are 0 and 1. Therefore, 

Pi) =Fhe = 15)=pa—O4- Pa = 1) —050 +040 = 099 
First, we find F(x) for each of the possible values of x, which are 0, 1, 2, and 3. 
FO) = Fie 0) =i) 
F(1) = P< 1) =0.50 + 0.40 = 0.90 
F(2) = P(x < 2) = 0.50 + 0.40 + 0.08 = 0.98 
F(3) = Px $3) = 0.50 + 0.40 + 0.08 + 0.02 = 1 
The plot of F(x) is presented in Fig. E3.1(b). 


F(x) 


Fig. E3.1(b) 
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(e) The sample mean is the total number of defects divided by 100. 
a O(SO) + 1(40) + 2(8) +3(2) | 
. 100 7 
This can be written as 
mean = 0(0.50) + 1(0.40) + 2(0.08) + 3(0.02) = 0.62 


The above calculation for the mean shows that the mean of a perfect sample can be obtained by multiplying 
each possible value of x by its probability, and summing the products. which is the definition of the 
population mean of a discrete random variable. 


Example E3.2 


Table E3.2 gives the probability distribution of a discrete random variable x. 


0.62 


Table E3.2 
x 10 1 3 4 5 
0.30 | 0.20 | 0.12 
Find the following probabilities: 


(a) P(x=1), Px <1), Px <3) and PO <x<2) 
(b) probability that x assumes a value less than 3 


(c) probability that x assumes a value greater than 3 
(d) probability that x assumes a value in the interval 2 and 4. 


SOLUTION: 

(a) Be=1)=015 
Paes D)=A0) + PU) =003 + 0,13=0.16 
P(x <3) = PG) + P(4) + P(S) = 0.30 + 0.20 + 0.12 = 0.62 
PO <x=2)= PO) + P(1) + P(2) = 0.03 + 0.13 + 0.22 = 0.38 

(b) P(x<3)=P(O)+ P(1) + P(2) = 0.03 + 0.13 + 0.22 = 0.38 

(c) P(x>3)=P(4) + P(S) =0.20 + 0.12 =0.32 

(d) P(2< x<4)=P(2) + P(3) + P(4) =0.22 + 0.30 + 0.20 =0.72 


Table E3.3 lists the probability distribution of x. where x is the number of car accidents that occur in a city 


during a week. 
Table E3.3 


2 243 4 5 6 | 
0.12 0.06 | 


x 0) 
P(x) | 0.12 | 0.16 | 0.22 | 0.17 1 0.15 


(a) draw a line graph for the probability distribution 
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(b) find the probability that the number of car accidents that will occur during a given week in the city is 
(i) exactly 4 (ii) ~ at least 3 (iii) less than 3 (iv) 3to5 
SOLUTION: 
(a) The line graph is shown in Fig. E3.3. 


0.25 
0.2 


0 1 2 6} 4 5 
Number of car accidents 


Fig. E3.3 
(b) (i) P(exactly 4) = P(4) =0.15 
(ii) P(at least 3) = P(x = 3) = P(3) + P(4) + P(5) + P(6) = 0.17 + 0.15 + 0.12 + 0.06 = 0.50 
(iii) P(less than 3) = P(x < 3) = P(O) + P(1) + P(2) = 0.12 + 0.16 + 0.22 = 0.50 
(iv) P(3to5)=P(3) + P(4) + POS) =0.17+0.15 +0.12 =0.44 


Example E3.4 


According to a survey, 75% of all elementary school teachers in a city in the year 2005 were women. Assume 
that this result holds true for the current population of all teachers in that city. Suppose 2 teachers are 
randomly selected from the population of all teachers in that city. Denoting x as the number of women in 
that sample, construct the probability distribution table of x. Draw a tree diagram for this problem. 


SOLUTION: 
Let M = teacher selected is a man 
W = teacher selected is a woman 
Then P(W)=0.75 and P(M) = 1—P(W) =1-0.75 =0.25 
The tree diagram is shown in Fig. E3.4. 


Second teacher 


First teacher 


P(WW) = 0.75(0.75) = 0.5625 


0.75 
P(WM) = 0.75(0.25) = 0.1875 


P(MW) = 0.25(0.75) = 0.1875 
P(MM) = 0.25(0.25) = 0.0625 


Fig. E3.4 


Now, x= the number of women in a sample of 2 teachers. Table E3.4 lists the probability distribution of x, 
where x = 0 represents if neither teacher is a woman, x = 1 if the teacher is a woman and one is a man, and 
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x= 2 if the both teachers are woman. The probability in the table are obtained from the tree diagram in Fig. E3.4. 
The probability of x = 1 is obtained by adding the probabilities of WM and MW. 


Table E3.4 


Outcomes vi 
MM 0 
1 
2, 


WM or MW 
Ww 


Example E3.5 


A machine design class has 20 students and 11 of them are female. Suppose 2 students are randomly selected 
from the class and x denotes the number of females in the sample. 


(a) draw a tree diagram 
(b) find the probability distribution of «x. 
Assume here that the draws are made without replacement from a small population and the probabilities of 


outcomes do not remain constant for each draw. 


SOLUTION: 


Let A = first student selected is a male 
B = first student selected is a female 
C = second student selected is a male 
D = second student selected is a female 
(a) The tree diagram is shown in Fig. E3.5. 


First student Second student 


CIA P(AC) = 


— 
11/19 


P(BC) = 


[ 
paoy= 2( 1 

( 

( 


4019 P(BD)= 


Fig. 3.5 


(b) Table E3.5 lists the probability distribution of x, where x is the number of females in a sample of 
2 students. 


Table E3.5 


[Outcomes | x _ 


: 0.1895 


AD or BC 
; 0. 2895 
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Example E3.6 


Table E3.6 lists the probability distribution of the number of breakdowns per week for a machine based on 
past data. 


Table E3.6 


[Breakdowns per week[ 0 [| 1 | 2 | 3 | 
0.15 | 0.25 | 0.40 


(a) present this probability distribution graphically 
(b) find the probability that the number of breakdowns for this machine during a given week is 
(i) exactly 2 (ii) Oto2 (iii) more than 1 (iv) at most 1. 


SOLUTION: 


Let x denotes the number of breakdowns for this machine during a given week. The Table E3.6 lists the 
probability distribution of x. 


Table E3.6: Probability distribution of breakdowns 


0.40 


Fig. E3.6: Graphical representation of the probability distribution 
(b) (i) The probability of exactly, two breakdowns is 
P(exactly 2 breakdowns) = P(x = 2) = 0.40 


(ti) The probability of 0 to 2 breakdowns is given by the sum of the probabilities of 0, 1 and 2 
breakdowns 


P(0 to 2 breakdowns) = P(O< x < 2) 
= P(x=0)+ P= 1) + Pw=2)=0.15 + 0.25 + 0.40 = 0.80 


(iit) The probability of more than 1 breakdown is obtained by adding the probabilities of 2 and 3 
breakdowns 


P(more than 1 breakdown) = P(x > 1) 
= Px =2) + P(x=3) =0.40 + 0.20 = 0.60 
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(iv) The probability of at most 1 breakdown is given by the sum of the probabilities of 0 and 1 
breakdowns. 


P(at most 1 breakdown) = P(x < 1) 
= P(x=0)+ P&=1)=0.15 + 0.25 = 0.40 


Example E3.7 


Table E3.7 lists the probability distribution of x, where x is the number of defects contained in a randomly 
selected manufactured product. Find the mean and standard deviation of x. 


Table E3.7 


Bh 
0.12 | 0.05 | 0.03 | 0.01 


SOLUTION: 
Refer to Table E3.7(a) 


Table E3.7(a) 


Mean uw = XxP(x) = 0.35 defects (from Table E3.7(a)). 


Standard deviation 6 = x?P(x)— 2 = V0.75-0.357 = 0.79 defects. 


Example E3.8 
Find the mean, the variance, and the standard deviation of x, where x denotes the number that shows up 


when a fair die is rolled. 


SOLUTION: 


The probability function of x and the necessary calculations are summarised in Table E3.8. 
Table E3.8 


Px[p@] xp | xp) 


* 
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The mean given by 
pai 
b= Xxp(x)= m =) 


Variance is given by 
Ooi. 
2 2 2 
O° = Xv p(x)-u = — -| —] =2.92 
a) p 6 (=) 


The standard deviation = ./g2 =./2.92 = 1.71 


Hence, mean = 3.5, variance = 2.92 and standard deviation = 1.71. 


3.1.3 Continuous Random Variables 


A random variable is continuous if its probabilities are given by areas under a curve. The curve is called a 
probability density function for the random variable. The probability density function is sometimes called 
the probability distribution. 


If the range space of the random variable X is an interval or a collection of intervals, X is called a 
continuous random variable. If a and b are real numbers and a < b, then the probability that X lies in the 
interval [a, b] is defined as 


b 
Plas X<bj= | f(x) dx G.14) 


The function f(x) is known as the probability density function (or pdf ) of the random variable X. The 
pdf must satisfy the following conditions: 


Ler 0, —c<X<oco 
2 Jf@)dr=1 (3.15) 


The probability density function (pdf) is shown in Fig. 3.3. The shaded area in Fig. 3.3 represents the 
probability that X lies in the interval [a, b]. 


From Eq. (3.14), for any specified value of x, P(X = x) = 0, since 


Jf@a=0 (3.16) 
Hence, we can also write 
PiasX<b)=Pa<X<b)=P(asX<b)=P(a<X <b) GIy 
Since x is continuous 
F(x) = | fat (3.18) 


The following conditions are satisfied 


1. F is a non-decreasing function. If a < b, then F(a) < F(b). 


9 lim F(x)=1 
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li 0 
er) =0 (3.19) 


All probability questions about X can be expressed in terms of the cdf. For instance 
P(a< X<b)= F(b) — F(a) fora<b (3.20) 


Probability density 


Fig. 3.3: The probability density function (pdf) 
(The shaded area under the curve is P(a < X < b)) 
The cumulative distribution function of a continuous random variable X is F(X) = P(X < x), just like for 
a discrete random variable. For a continuous variable, the value of F(x) is obtained by integrating the 
probability density function. 


Since F(x) = P(X Sx), we have 


F(x)= [ f@at (3.21) 


where f(t) is the probability density function. 
Hence, the continuous distribution function (cdf) of X is the function 


ih 


F(x)= P(X Sx)= f fat (3.22) 
3.1.4 Mean and Variance for Continuous Random Variables 
The population mean and variance of a continuous random variable are defined in the same way as those 
of a discrete random variable, except that the probability density function is used instead of the probability 
mass function. Specifically, if X is a continuous random variable, its population mean is defined to be the 
center of mass of its probability density function and its population variance is the moment of inertia around 


a vertical axis through the population mean. 
Let X be a continuous random variable with probability density function f(x). Then, the mean is given by 


co 


bw, = f x f(x) dx (3.23) 


oo 


The mean of X is sometimes called the expectation or the expected value of X and denoted by E(X) 


or by pL. 
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The variance of X is given by 


oy = | Gp.) fo) de 


An alternative formula for the variance is given by 


co 


ox = | x f(x) de-py 


—oo 


The variance of X is also denoted by V(X) or by 0”. 


The standard deviation is the square root of the variance 


oy = 0% 


Alternate Formula for the Variance 
Starting with the equation 


c= i, POs 
Expanding the above equation, we obtain 


Ox = D(x? -2xny + Hy) P(X =x) 


or o = ix ex = x)-2xpy P(X = x)+p3 P(X = 


x 


Summing the terms separately, we have 
Ox = x P(X =x)- Amy P(X =x)+ Dw P(X = 3) 
x x x 
Noting that 


>, 2xty P(X = x)= py >) xP(X = x) = Wghy = 2% 


and Dg Re =x) = ee a) See 
Substituting Eqs. (3.31) and (3.32) in Eq. (3.30), we get 


Oy = x P(X =x)—2u2 +2 


x 


or Oy = 4 PO =x) 


3.1.5 Expectation 


A brief summary is given here: 


(3.24) 


(3-25) 


(3.26) 


B27) 


(3.28) 


(3.29) 


(3.30) 


G35) 


(3.32) 


G35) 


(3.34) 
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If X is a random variable, the expected value of X, denoted by E(X), for discrete and continuous variable 
is defined as 


EQ) > xp) if X is discrete (3.35) 


alli 


and EA [ee (x) dx if X is continuous (3.36) 


the expected value E(X) of a random variable X is also referred to as the mean of X, Hl, (Or 1). or the first 
moment of X. The quantity, E(X"), n > 1, is called the n” moment of X and is defined as 


E(X”)= > x; p(x;) if X is discrete (3.37) 
alli 


and Og = ie x" f(x)dx if X is continuous (3.38) 


The variance of a random variable, X, denoted by V(X), Var(X), or 0”, is defined by 

VX) = E(X— E(X)P) 
A useful identity in computing V(X) is given by 

V(X) = E(X’) — [ECX)P (3.39) 
The mean £(X) is a measure of the central tendency of a random variable. The variance, V(X), is a 


measure of the spread or dispersion of the possible values of X around the mean E(X). The standard 
deviation, 6, is the square root of the variance. 


Example E3.9 


The life of an electronic component is given by X, a continuous random variable assuming all values in the 
range x = 0. The pdf of the lifetime, in years is given as 

1 a 
—e*, x20 


FO) =) 4 


0, otherwise 


(a) determine the probability that the life of this component is between 2 and 4 years. 
(b) determine cdf of the components 
(c) determine the probability that this electronic component will last for 4 years 


(d) find the mean and variance of the life of the component. 


SOLUTION: 


ameiod 1 
(a) Pasx<4)=—fe 4dx=—e'!+e 2 =0.2386 
® 


The probability that the laser will last between 2 and 4 years is obtained also from 
P(2<X <4)=F(4)- F2)=(1-e) -(l-e™) =e" -e ' = 0.6065 — 0.3679 = 0.2386 


94 // Probability and Statistics for Scientists and Engineers || 


(b) 


(c) 


(d) 


The cdf of this component is given by 
pea = 
F(x) =—[e 4dx= le 
4% 
The probability that the component will last for 4 years or less is given by 
PO<X<4)= F(4)-FO0) = F4) =1-e = 0.6321 
The mean and variance of the life of the electronic component are determined as follows: 


for} °° 


) _* 1 a 
+fe 4dx=0+—~e 4 
, 1/4 


E(X) = = {xe ‘as =xe 4 
0 


= 4 years 


0 
To compute V(X), first compute E(X°) as follows: 
ay 1 t 2») -= a) -= i: 
E(X*)=—[xe 4dx=xe 4 
4% 


+8[ xe 4dx = 32 
0 0 


Hence, variance is given by 
V(X) = 32 — 4 = 16 years 


Example E3.10 


Given f(x) = e* for 0 < x. Find the following probabilities: 


(a) 
(b) 
(c) 
(d) 
(e) 


P= xX) 
IAQ eS) 
PX =3) 
P(X <4) 
P3SX) 


SOLUTION: 


(a) 


(b) 


(c) 


(d) 


co 


P< X)=[e"dx=(-e*)| =e = 0.3679 


1 


1 


2.5 oa 


P< X <25)= | e*dx=(~e*) 


1 


=e !-e* =0.2858 


| 
5) 

P(X =3)=[e“dx=0 
3 


4 


4 
P(X <4)=[ ede =(-e*)| =1-e4 =0.9817 
0 


0 
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[] 


(e) PGSX)=[e*de=(-e*)| =e? =0,0498 
3 


3 


Example E3.11 


The probability density of the continuous random variable X is given by 


? 


WS for2<x<7 
i, 


elsewhere 


Find (a) 7PG<X <5) 


(b) area under the curve. 


SOLUTION: 


aa ile 1 2 
(a) P@<X<5)=[=dx=—[idx=—6-3)== 
15) ce 5 5 


a 7 
(b) Area under the curve = J fojdr= Je de= x 


Example E3.12 


Given f(x) = 1.5x” for —1 <x < 1. Find the mean and variance of x. 


SOLUTION: 
1 1 oA l 
E(X) =f x(1.5x?)de =f 1.5x°dx= We) = 


=i -l 


=| 


5 1 


i J 
V(X)= f15x?(x-0)' de =1.5 | x*dx = 15” 
= =) 


=I 


Example E3.13 


The pdf of the random variable X is given by 


ae for0O<x<4 
= oor 


0 elsewhere 


Find (a) the value of a 


(b) fx<i| and P(X > 1). 
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SOLUTION: 


4 172 |4 
x 
Area=1= | —=dx=al x ?dx=a—| =2a2=4a 
Grae I+ le We). 
Hence, a= 1/4. 
1/4 4 1/4 
1 ifo1 
(b) p(x <4}- 1 pel | reals js 
4) 4x 44 41/2) 22 4 
l 1 1 
TAO = i) = lor =ds=1- Va — 
04 0 2 
Example E3.14 


The thickness of a conductive coating in micrometers has a density function of f(x) 
< 120 um. 


600 x? for 100 um < x 
(a) find the mean and variance of the coating thickness 
(b) 1 


if the cost of coating is $0.25 per micrometer of thickness on each component, find the average 
cost of the coating per component. 
SOLUTION: 


120 600 120 
(a) E(X)= | 


4—, dk = = 600I1n x 
Xe 


= 109.39 
100 


100 
120 


V(X)= J (x-109.39) Ses 600 f 1- 


2(109.39) (a 09.39)" 
100 100 x ze 


120 
= 600(x—218.75In x~109.39° x") = 33.19 


(b) Average cost per component = $0.25(109.39) = $27.35 


Example E3.15 
The distribution function of the random variable X is given by 


FO ee for x >0 
0 for x <0 
Find fa) Pixs?) 
(Db) Pu <3) 
(c) P(X>4) 
SOLUTION: 
(a) P(X $2) = F(2) = 1 — 3e? = 1 — 3(0.1353) = 1 — 0.4074 = 0.5926 
(b) - = 
(c) 


PI < X <3) = F(3) -— F(l) = 1 — 4e? — 1 + 2e7 = 2e" — 4e> = 203679) 4(0.0498) = 0.5366 
P(X > 4) = 1 ~ F(4) = Se* = 5(0.0183) = 0.0915 
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Example E3.16 


The pdf of a random variable X is given by 


4 3 
joan? forO<x<] 
0 elsewhere 


Sketch the pdf and find the values of the following: 


(a) o{x<3] 
yy 
1 3 


1 
(c) of x 3) 


SOLUTION: 
The plot is shown in Fig. E3.16. 4/3 


esr dx 
(a) p(x <i]= [ 4a-29)= = 0.6458 
a 3 
3/4 
| 4a)" = 0.5605 
1/4 3 


(b) p(i<x<2)- 


1 0 Xx 
(c) (x >3\= | 4-0) = 0.5597 1 
a Fig. E3.16 pdf of the function f(x) 

Example E3.17 

The pdf of a random variable X is given by 

ax” forO<x<2 
xX = 
© F elsewhere 


(a) find the value of the constant a and sketch the pdf 
(b) find P(X > 3/2). 


SOLUTION: 


(a) We must have 


co 2 7 1] 
grenades ara! 
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Hence, a = 3/7. The pdf is shown in Fig. E3.17. 


Fig. E3.17 


(b) P(X >3/2)= | f(x) dx= | = x°dx = 37/56 


3/2 3/2 


3.2 PERMUTATIONS AND COMBINATIONS —~ - 


Factorial Notation: The product of the positive integers from | to n inclusive is denoted by n! (read as “n 
factorial”). The value of the factorial of a number is obtained by multiplying all integers from that number 
iol: 


The symbol n!, read as “n factorial” represents the product of all integers from n to 1. In other words, 


nt =n(n— 1) (n—2) (n—3) +++ 3-2-1 (3.40) 
It is also convenient to define 

OFS ele lvand 77: (Gy — W)!) (3.41) 
Example E3.18 


Evaluate the following: 
(a) 2!,3!, 4!, 5!, 6! and 7! 
(b) (7-3)!, (14 — 12)!, 7 —2)!, 13 —4)!, and (6 — 6)! 


8! 12! 2H 
(c) 6!’ OF and 9731 
SOLUTION: 
(a) 2F=21=2 
= 3-2 ='6 
a= 432 a 24 


We S3 2-1 = 120 

6! = 6:5-4-3-2-1 = 720 

7! = 7:6'5-4-3-2:1 = 5040 
(b) (7-3)'=4!=43-2-1=24 

(14—-12)!=2!=2.1=2 


/! Random Variables and Probability Distributions // 99 


(7—2)l= 5! =5-4-3-2-1 = 120 
(13 — 4)! = 9! = 9-8-7-6:5-4-3-.2-1 = 362,880 


(6-6)!=0!=1 
_ 8! 8-7-6! 
(c) B= =87=56 
12! _ 1211-109! 
Siem tL t= 1320 
‘12! 12-11-1091 12-11-10 
SCiameeam 32  ~-~ 
Example E3.19 


Find the value of 9! by using the table in Appendix-A. 


SOLUTION: 


We locate 9 in the column labeled n. Then we read the value in the column for n! entered next to 9. Hence. 
9! = 362, 880 


3.2.1 Permutations 


Any arrangement of a set of n objects in a given order is called a permutation of the objects (taken all at a 
time). Any arrangement of any r <n of those objects in a given order is called an r-permutation or a 
permutation of the n objects taken r at a time. 


Ordered arrangement of a set of objects are called permutations. When the order in which they are 
arranged is disregarded, then the arrangements are known as combinations. The number of permutations 
of n different items is the number of different arrangements in which these items can be placed. A permutation 
of n different objects taken r at a time is an arrangement of r out of the n objects with attention given to the 
order of arrangement. The number of permutations of n objects taken r at a time is denoted by nP, 


P(n, r) P”, (n), or P_ and is written as 


! 


(n-r)! 


nP. = n(n—1)(n—-2)---(n-r+]1)= (3.42) 


Derivation of the Formula nP. 

The first element in an r-permutation of n objects can be chosen in n different ways. Following this way. the 
second element in the permutation can be chosen inn ~ | ways, and the 3 element in the permutation can be 
chosen in n — 2 ways etc. Finally, the r” (last) element in the r-permutation can be chosen in 
n—(r—1)=n—r+1 ways. Hence, from the fundamental principle of counting, we have 


nP_ = P(n, r)=n(n—1)(n—-2) ++ n-r+ 1) (3.43) 
n(n-1)(n-2)-(n-r+)(n—-r)! in! G.A4) 


or = n(n—1)(n- 2) (-r+ = (1—Dyeeee Onn! 


— jo 
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in which n! is the symbol for factorial n = n(n — 1)-- 3x 2x 1, whenr=n,n P =n! (note by definition 
0! = 1). Tables of the values of factorial n are given in Appendix-A. 

The permutation of n items taken all at a time, when the n items consists of r, alike, r, alike, ... 7, alike, 
so that by definition r, +7, + °°: +7, =n. 


The number of permutation is then 


! 
n! (3.45) 


Rly br, | 


Also, we have nP_ = n(n — 1) +: 3-2:1 =n! 


Example E3.20 
How many five-digit numbers can be formed from the numbers 1 to 8? 
SOLUTION: 
Here v= 8 and7=5. 
cn ne I AO SEED Sea 
" (n=r)! (8-5)! 32 xa 


Example E3.21 


How many numbers of permutations of letters are there in the word quality? 


SOLUTION: 


Here n=7 (7 letters in word quality) 


r, = 1(q) 
r, = 1) 
r,= l(a) 
r,=10 
r,=1@) 
r= 10 
r,=1(y) 
Therefore, 
n!} Th 


=7xX6x5xX4x3x2x1= 5040 


Rly bony WEE d! 


Example E3.22 
ee 


(a) Evaluate iE Gn IGE 3P, and 6P, 


(b) five persons arrive at the bank teller window at the same time. In how many different ways can 
these people line up? 


(c) find the number of ways of arranging the letters in the word subject 
(d) find the number of ways if each arrangement in part (c) starts with the letter j. 
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SOLUTION: 
(a) 7P,=765=210 
6P, = 6:5:4-3 = 360 


16P, = 16 
3P, = 32:1 =6 
6P, = 65-4 = 120 


(b) These paper can line up in different ways 
5! = 5-4-3-2-1 = 120 ways. 

(c) There are 7 distinct letters in the word subject. Hence, the number of arrangements is 7!. That is 
7! =7:65-4-3-2:1 = 5040 

(d) Since the first letter is fixed as j, the total number of possibilities are found by arranging the 
remaining six letters in all possible ways. This can be done by 6! ways. 
6! = 6°5-4-3-2-1 = 720 ways. 

3.2.2 Combinations 


As noted earlier, the order in which objects are arranged matters in permutations. However, order does not 
matter in combinations. 


The number of combinations of n different objects taken r at a time is a selection of r out of the 1 
objects with no attention given to the order of arrangement. The number of combinations of objects 


, n\, ; : 
taken r at a time is denoted by nC, C(n, r), C, . or (*) is r! times smaller than the number of permutations. 
r 


Therefore, 


C= nm) nF n!} 
erly} orl (n-nir! (3.46) 


It should be noted from symmetry that the identity is valid 
nC_=nC (3.47) 


n\ nP, 

Also =— 
r ip 

The arrangement for dividing the number of permutations by r! to get the number of combinations is 
that each combination give rise to r! possibilities when arranged in all possible ways, thereby giving all the 


permutations. 


Tables of binomial coefficients {*} are given in Appendix-B. 
iz 


Note that in combinations, n is always greater than or equal to r. If 7 is smaller than r, then we cannot 


select 7 distinct elements from n. 
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Example E3.23 


EX@MPI€ BS. 


(a) Evaluate the following: 


(3-6-0) 6) 


(b) Four cards are chosen in succession from a deck of 52 cards. Find the number of ways this can be 
done (i) with replacement, (ii) without replacement. 
(c) find the number m of groups of 3 that can be formed from 8 students in a class. 


SOLUTION: 


(a) 


Cio! ae 
6) 6'(6—6)! O! 9] 


| 
| 
C 9): See 2a) 
| 
| 


| @=3)G) Gaines oo Te 


4! 4! 


04-0)! 1-4! 


>! 5103.43.21 
—_—_—— = =10 


= eee IS 
(15—2)!2! 13-12----2-1-2-1 


20! 


& = 125,970 
8 } (20-8)!8! ~~’ 


| 
| 

bl 1S! 1514-13-21 
| 


(b) (i) Since each card is replaced before the next card is chosen, each card can be chosen by 
52 ways. Hence, there are 


(52)(52)(52)(52) = 52* = 7,311,616 
different ordered samples of size n = 4 with replacement. 


(ii) Since there is no replacement, the first card can be chosen in 52 ways, the second card in 51 
ways, 3 card in 50 ways and the last card in 49 ways. Thus there are 


P(52, 4) = 52(51)(50)(49) = 6,497,400 
different ordered samples of size r= 4 without replacement. 


(c) m=c4a.ry=[5 Ja STS a5 
3 3-2-1 


Since each group is essentially a combination of 8 students taken 3 at a time. 
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Example E3.24 


The president of a university has 12 departments under his administration. The president plans to visit 3 of 
these departments during the next week. If he randomly selects 3 departments from these 12, how many 
total selections are possible? Verify the answer using the table in Appendix-B. 


SOLUTION: 


The total possible selections for selecting 3 departments from 12 are 


12 ea 
= —_—____ = 220 
3} (12-3)!3! 


From the table in Appendix-B, we read for n = 12 and r = 3, the answer is 220. 


Example E3.25 


Out of 5 men and 7 women, a committee consisting of 2 men and 3 women is to be formed. In how many 
ways can this be done if 

(a) any men and any women can be included 

(b) one particular women must be included on the committee 


(c) two particular men cannot be included on the committee. 


SOLUTION: 

(a) 2 men out of 5 can be selected in 5C, ways. 

3 women out of 7 can be selected in 7C, ways. 

Hence, the number of possible selections = (SC,) (7C,) = (10) (35) = 350 
(b) 2 men out of 5 can be selected SC, ways. 

2 additional women out of 6 can be selected in 6 C, ways. 

Hence, the number of possible selections = (5C,) (6C,) = (10) (15) = 150 
(c) 2 men out of 3 can be selected 3C, ways. 

3 women out of 7 can be selected in 7C, ways. 

Hence, the number of possible selections = (3C,) (7C,) = (3) (Go 05 


3.3 DISCRETE DISTRIBUTIONS : FT 


Discrete random variables are used to describe random phenomena in which only integer values can occur. 


3.3.1 Hypergeometric Distribution 

In this section we consider dependent Bernoulli random variables. A common source of dependent Bernoulli 
random variables is sampling without replacement from a finite population. Suppose that a finite population 
consists of a known number of successes and failures. If we sample a fixed number of units from that 
population, the number of successes in our sample will have a distribution that is a member of the family of 


hypergeometric distributions. 
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Definition of the Hypergeometric Distribution 
A set of N objects contains K objects classified as successes and N — K objects classified as failures. A 
sample of size n objects is selected randomly (without replacement) from the N objects, where K < N and 
nN. 

Let the random variable X denotes the number of successes in the sample. Then X is a hypergeometric 


random variable and 


ea 
i 


The expression {K, 1} is used in the definition of the range of X because the maximum number of 
successes that can occur in the sample is the smaller of the sample size, n, and the number of successes 
available, K. 


x =max{0,n + K—N} to min{K, n} (3.48) 


Also, if n + K > N, at least n + K — N successes must occur in the sample. 
It should be noted here that in Eq. (3.48) 


a\_ a! 
bs ~ bWa—b)! (3.49) 


is the number of a parts taken b at a time. The hypergeometric distribution is the appropriate possibility 
model for sampling without replacement. Other distributions are often employed when the ratio of n/N 
becomes small, say 0.10 or less. 


The mean and variance of a hypergoemetric random variable can be determined from the trials that 
comprise the experiment. However, the trials are not independent, and so the calculations are more difficult 
than for a binomial distribution. The results are stated as follows: 

Mean and Variance of a Hypergeometric Distribution 


If X is a hypergoemetric random variable with parameters N, K, and n, then the mean 


ww = E(X) = np (3.49a) 
and the variance 
2. - N-n 
o° =V(X)=np(l- p| Ne (3.49b) 
K 
where a 3.49c 
p=* (3.49) 


Here, p is interpreted as the proportion of successes in the set of N objects. For a hypergeometric 
random variable, E(X) is similar to the mean of a binomial random variable. Also, V(X) differs from the result 
for a binomial random variable only by the term shown below: 


The term in the variance of a hypergeometric random variable Nis 


; is called the finite population 


correlation factor. 
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Example E3.26 


Given that X has a hypergeometric distribution with N = 100, n = 4 and K = 20. Determine the following: 
(a) P(X=1) 
(b) P(X=6) 

(c) PX=4) 
(d) the mean and variance of X. 


SOLUTION: 
Here, K = 20, X = 1, N= 100 andn = 4. 


oO 


4 
(b) P(X =6)=0, since the sample size is only 4. 


K\iN-K 20 \f 80 
n—-X 
(c) P(X=4)= ea Lal). a Scala 
100 3921225 
n 4 


K 20 
dad) Mean 2(X)=np = nj —)}=4| —_|=0. 
(d) Mean = E(X) =np (<] (| 0.8 


(a) P(X=1)= = 0.4191 


Variance = V(X ) = np(1 — p) (s=*)- 40(0.2)(0. | = - 0.6206 


Example E3.27 
Suppose a box contains five red balls and ten blue balls. If seven balls are selected at random without 
replacement, find the probability that at least 3 red balls will be obtained. 


SOLUTION: 
Let X denotes the number of red balls that are obtained. Then, X has a hypergeometric distribution, D = 5, 
A+B=15,n=7, B= 10and A =5. The maximum value of X is min{n, A} = 5. Hence, 
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Example E3.28 


A lot of 75 gaskets contains five in which the variability in thickness around the circumference of the gasket 
is unacceptable. A sample of 10 gaskets is selected at random, without replacement. What is the probability 


that 


(a) 
(b) 
(c) 
(d) 


none of the unacceptable gaskets is in the sample 

at least one unacceptable gasket is in the sample 
exactly one unacceptable gasket is in the sample 

the mean number of unacceptable gaskets in the sample. 


SOLUTION: 


Let X denotes the number of unacceptable gaskets in the sample of 10. 


(eS ( 
il 
(79 " 
0/\10 10!60! _ 65x64x63x62x61 
-) = =e eee oe 
(a) X= 0) = sya mis = 9574579 : 
10} 10165! 
(b) P(X>1)=1-P(X=0)=1-0.4786 = 0.5214 
nies 5170! 
I\9 ) “ow:  5x65x64x 63x 62x10 
c) PX=0)= ———_—_— = + SS = 0.3923 
ee) Sia 75! 75x 7AX73X72X71 : 
10) 10165! 
(d) ax)= 10 =) =2 
Fj. 3 


Example E3.29 
SE 


Of 50 manufactured steel rods in a production process by a company, 12 have defects. If 10 steel rods are 
selected at random for inspection, 


(a) find the probability that exactly 3 of the 10 have defects 
(b) find the mean and variance of X. 


SOLUTION: 


(a) Let X represents the number of sampled steel rods that have defects. Then, K = 12, N = 50 and 
nh=10: 
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Sy Ora LOT 


Le eal 
x nx 3. J\7_} _ (220)(12,620,256) _ 


PX=3)= —— oS = = 02703 


N 50 10,272, 278,170 
n 10 
(b) Mente =) oy 
50 
h 1 
WwW. ee 
ere DP 50 
N-n 12 12 \/ SO=—10 
Nari = 0 =np(l- )-1o{ 21-2) | =1.4800 
Ea a | oem 50) 50)\ 50-1 
Example E3.30 


A large bin contains 80 balls of which 32 are red balls and 48 are blue balls. Suppose 15 balls are picked at 
random. Find the probability of getting 4 red balls, the mean number of red balls, and the standard deviation 
of the number of red balls if the sample is picked 


(a) with replacement 
(b) without replacement. 


SOLUTION: 
We will identify success with “picking a red ball” and let X = number of red balls in the sample. 


(a) Ifthe sampling is with replacement, we have a binomial distribution with 15 trials and probability 


32 
of success p = 30 ~ 0.4. Hence, 
P(X = 4) = Binomial(4; 15, 0.4) = 0.127 (from binomial distribution in Appendix-C) 
pw =np = 15(0.4) 


o = Jnp(1— p) = ¥15(0.4)(0.6) =1.9 


(b) If the sampling is without replacement, we have a hypergeometric distribution and 


) Gs 


Note that this answer is not much different from the one obtained in part (a) using the binomial formula. 
The reason is that N (= 80), compared to n (= 15), is fairly large. 


Mean, w=np = 15(0.4) =6 
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Standard deviation 


Nn 32 ae 
2 2 Bhs 2 |( eee =1.72 
Set P)| =" (=) 80) | 80-1 


3.3.2 The Binomial Probability Distribution 


The binomial probability distribution is one of the most widely used discrete probability distribution. It is 
used to find the probability that an outcome will occur x times in n performances of an experiment. The 
binomial distribution is applied to experiments that satisfy the four conditions of a binomial experiment. 
Each repetition of a binomial experiment is called a trial or a Bernoulli trial (after Jacob Bernoulli). 


A trial with only two possible outcomes is used so frequently as a building block of a random experiment 
that is called a Bernoulli trial. It is usually assumed that the trials that constitute the random experiment 
are independent. This implies that the outcome from one trial has no effect on the outcome to be obtained 
from any other trial. In addition, it is often reasonable to assume that the probability of a success in each 
trial is constant. 


3.3.3 The Binomial Experiment 
An experiment that satisfies the following four conditions is called a binomial experiment. 


1. There are n identical trials. In other words, the given experiment is repeated n times. All these 
repetitions are performed under identical conditions. 

2. Each trial has two and only two outcomes. These outcomes are usually called a success and a 
failure. 

3. The probability of success is denoted by p and that of failure by g, and p + q = 1. The probabilities 
p and q remain constant for each trial. 


4. The trials are independent. In other words, the outcome of one trial does not affect the outcome of 
another trial. 


One of the two outcomes of a trial is called a success and the other a failure. 


The random variable X that equals the number of trials that result in a success has a binomial random 
variable with parameters 0 < p< 1 andn = 1, 2, .... The probability mass function (pmf) of X is 


n x i 
roo=["lp (ey), oa OO er, (3.50) 


3.3.4 The Binomial Formula 


3.3.4.1 Binomial Theorem 


The n" power of (p + g) can be expressed in terms of binomial coefficients as 


S =1)= n! 
+q)" = p"+np" , n=) BOF 55, ee at Ge ee 
(p+q) =p’ +np"q Dy Pk aie ae ee 


| 2 ie fe ~ n-ror 
= > J =) (Grae (3.50a) 
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Since (p + q) = (q + p), we can also write 


(po) = Geng’ (3.51) 


r=0 
Consider n trials in each of which the probability of success is p. Then the probability of failure is 
| — p =q. To find the probability of r successes, we observe that: 
the probability of 1 success in 1 try is p, 
the probability of 2 successes in 2 tries is p x p or p’, 
the probability of 3 successes in 3 tries is p x p x p or p’, 


the probability of r successes in r tries is p" and 
the probability of subsequent (n — r) failures in (n — r) tries is (1 — p)"” = q’. 

The probability of r successes followed by (n — r) failures is p’(1 — p)"". Here, we have considered only 
one particular group or combination of r events; 1.e., we have started with r successes and finished with 
(n — r) failures; every other possible ordering of r successes and (n — r) failures will also have the same 
probability. 

The number of possible orderings or the number of selections for r successes and (1 — r) failures inn 
trials is n!/[r!(n — r)!]. 


Therefore, the probability P_ of an event succeeding r times is 


n! 


P= j= n-r 
ST eey d=) (3.52) 
or P= C p'q"" (3.33) 
This term is similar to the r” term of the binomial expansion (q + p)" [see Eq. (3.51)]. which can be 
written 
+p" =>,.C.2a"" (3.54) 
r=0 


The successive terms of the expansion give the probability P_ of an event succeeding r times in 7 trials 
for values of r varying in steps of one of 0 to n. 


3.3.4.2 Cumulative Terms for Binomial Distribution 
The probability of an event succeeding at least r times in n trials. 
The probability P_ of an event succeeding exactly r times in n trials is given by Eq. (3.53) 
P= Cpq" 


The probability of an event succeeding at least r’ times in 7 trials is given by 


Pa 7 Crug” (3.55) 
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The values of the summation of Eq. (3.55) are given in Appendix-D for p ranging from 0.05 to 0.50; with 
n between 2 and 20, and 7’ between | and 20. 
For p > 0.5, we can utilise the fact that the probability is 


rn 


- ¥ S610 lf a ip (3.56) 


r=n-r’+1 


3.3.4.3 Mean and Standard Deviation of Binomial Distribution 
If p is the proportion of successes in the population, then the mean number of successes in n trials is 
u=np (3.57) 
This is clear, as the mean number of successes in n trials is equal to the probability of success in one 
trial times the number of trials. 


The standard deviation for a binomial frequency distribution is 


O=,/npq or .np(1—p) (3.58) 
q is not independent but is equal to (1 — p). The binomial distribution can be expressed in terms of two 


parameters, n and p. 


Equations (3.57) and (3.58) will now be derived using the definition of expectation for the mean and 
variance. We have 


E(r)= w= Dirk, = ree i 
r=0 1 
n (n —1)! rei ree . 
e (NN eee | = di n 
ies, eal) Ge aaa) p (-p) nplp+q] 
; _ (3.59) 


For the variance 07 


E(r-y =o 2 ae uw)’ P= me P.- ond rP +4 25) p 


r=0 n=O r=0 


= rr -2nu)+y 


r=0 


n 


Since S: P=1 
r=0 
and by rP.=W=np 
r=0 
Thus D)r°P, — (np) = Dire) +r 1h, ~ (mp 
r= =0 
= > rr- Dae p (l-p)"" + > rP, - (np) 


! 
= r=0 
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= Raa (n~2)! 
GS 3 ame 


=n(n — lp*(p + gy"? + np ~ (npy 
O =n(n— 1)p* + np — (np) = np(1 - p) =npq (3.60) 
Hence O =./npq 


p’(1- p)"” +np—(npy’ 


or 


(3.61) 
Summarizing, if X is a binomial random variable with parameters p and n, then the mean, 


l= E(X) =np (3.61a) 
and the variance, 


o = V(X) = np(1 —p) (3.61b) 
A binomial distribution with n = 20 and p = 0.10 is shown in Fig. 3.4(a). 


Cumulative binomial distribution are given in Appendix-D. 


P(x) 
0.3 
0.2 
0.1 


0.0 


Fig. 3.4(a): Binomial pmf for n = 20, p = 0.10 


Example E3.31 


Let X denotes the number of mechanical components that are defective in a testing process and assume 
that X is a binomial random variable with p = 0.001. If 1000 of these components are tested, find the following: 


(Qt (X =1) 

(2 P(X 21) 

top rx SZ) 

(d) mean and variance of X 
SOLUTION: 
We have from Eq. (3.50) 


n 
iz x(q — pyr 
f (x) ("| p’ (1-p) 


(a) P(X=1)= ea (0.001)'(0.999)"”” = 0.368 
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1000 
(b) P(X2 p=1-Per=0)=1-[ ; 


| (0.001)'(0.999)” = 0.632 


1000 
(ec) POS2)= S| (0.001)°(0.999)'°™ +| , ].o.001' 0.999)" 


+ “4 (0.001) (0.999) = 0.920 
(d) E(X)=1000(0.001) = 1 
V(X) = 1000(0.001) (0.999) = 0.999 


Example E3.32 


Picea ih 
A professional basket player makes 80% of the free throws he tries. Assuming this percentage will hold 
true for future attempts. find the probability that in the next eight tries the number of free throws he will 
make is 

(a) exactly 8 

(b) exactly 5. 


SOLUTION: 
Here n = 8, p = 0.80, q= 1—p = 1-—0.80 = 0.20 


nA 


8 
(a) Piexeetly8)= P(8) = | | p(i—p)"*= ") (0.80)°(0.20)° = 1(0.167772)(1) = 0.1678 


xX 


8 
5 


(b) P(exactly 5) = P(5) = ") pl —py*= ) (0.80)5(0.20)° = (56)(0.32768)(0.008) = 0.1468 


Example E3.33 


A soft drink company conducted a taste survey before marketing a new soft drink. The results of the survey 
showed that 80% of the people like this soft drink. On a certain day, 8 customers bought it. 


(a) let x denotes the number of customers in this sample of 8 who will like this soft drink. Using the 
binomial probabilities table, find the probability distribution of x and draw a graph of the probability 
distribution. Find the mean and standard deviation. 


(b) Using the binomial distribution of part (a), find the probability that exactly three of the eight 
customers will like the soft drink. 


SOLUTION: 


(a) Here n = 8, p = 0.80. From Appendix-C, for n = 8, and p = 0.88 we have x, P(x) tabulated as in 
Table’ E3:33. 
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Table E3.33 


0 
| 
2 
3 
4 
5 
6 
7 
8 


0.0459 
0 0.00010.00110.0092-—— 


Fig. E3.33 
The mean and standard deviation of x are (from Eqs. (3.61a) and (3.61b) 
Mean 
L = np = 8(0.80) = 6.4 


Standard deviation 


o = Jnpq = J8(0.80)(0.20) = 1.1317 


(b) P(exactly 3 customers like the soft drink) = P(3) = 0.0092 


le E3.34 


A certain mechanical system contains 10 components. Assuming that the probability of each individual 
component will fail is 0.2 and the components fail independently of each other. Given that at least one of 
the components has failed, what is the probability that at least 2 of the components have failed? 


SOLUTION: 

The number X of components that fail will have a binomial distribution with parameters n = 10 and 
p = 0.2. Hence, 
Res 2) _ I=Px =O) SPX = 1) 


Se 5, SE 0) 
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From Appendix-C, for n = 10, p = 0.2, we have 


1—0.1074—0.2684 0.6242 


= = 0.6993 
1—0.1074 0.8926 


P(X 22|X 21)= 


Example E3.35 


(a) Ifa fair coin is (probability of heads equals 1/2) is tossed independently 10 times. Use the table in 
Appendix-C of the binomial distribution to find the probability that strictly more heads are obtained 
than tails. 

(b) Suppose that the probability that a certain experiment will be successful is 0.3, and let X denotes 
the number of successes that are obtained in 15 independent performances of the experiment. Use 
the table in Appendix-C for the binomial distribution to determine the value of P(6 < X < 9). 


SOLUTION: 


(a) Let X be the number of heads obtained. More heads than tails are obtained if Xe {6, 7, 8, 9, 10}. 
The probability of this event is the sum of the numbers in the binomial table in Appendix-C. 
Corresponding to 


P= 5 
and r= 10) 
for x=6, 7, 8,9, 10 =0.2051 + 0.1172 + 0.0439 + 0.0098 + 0.0010 = 0.37695. 


By the symmetry of this binomial distribution, we can also compute the sum as 
(1 — P(X = 5))/2 = (1 — 0.2461)/2 = 0.37695. 
(b) From the table in Appendix-C for the binomial distribution with parameters n = 15 and p = 0.3, that 
P6s XS9)=PXK=6) + PX =7) + P(X =8) + PX=9) 
= 0.1472 + 0.0811 + 0.0348 + 0.0116 = 0.2747 


Example E3.36 


According to a particular survey, the probability is 0.40 that a traffic fatality involves an intoxicated or 
alcohol-impaired driver or non-occupant. In 8 traffic fatalities, find the probability that the number, A, which 
involve an intoxicated or alcohol-impaired driver or non-occupant is 


(a) exactly 3; at least 3; at most 3 

(b) between 2 and 4, inclusive 

(c) find and interpret the mean of the random variable A 
(d) obtain the standard deviation of A. 


SOLUTION: 
Here, n = 8 and p = 0.40. Thus g = 1 -p = 1 - 0.40 = 0.60 
From Eq. (3.50), we have 


f@= "| p*(1- py" 
DG 


Refer to the table in Appendix-C for n = 8 and p = 0.40 
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8 
(a) P(exactly 3) = | (0.4)°(0.6)° = 0.0241 


8 . 
P(3 or more) = 1 — PO or 1 or 2) = 1 - lal (0.4)°(0.6)* - ") (0.4)'(0.6)" — | (0.4)°(0.6)° 


= 1 — 0.0168 — 0.0896 — 0.2090 = 0.6846 
P(at most 3) = P(O) + P(1) + P(2) + PB) 


8 8 8 8 
= 5 (0.4)°(0.6)8 — | | (0.4)'(0.6)’ — i (0.4)°(0.6)°+ ; (0.4)*(0.6)° 
= 0.0168 + 0.0896 + 0.2090 + 0.2787 = 0.5941 


8 
(DIS P2 SA 54) =P) + PGCY+P@)= | (0.4)°(0.6)° — ") (0.4)*(0.6)° — | “| (0.4)"(0.6)' 


= 0.2092 + 0.2787 + 0.2322 = 0.7201 = 0.72 


(c) The mean of A is t= np = 8(0.4) = 3.2. On average, of 8 traffic fatalities, 3.2 will involve an intoxicated 
or alcohol-impaired driver or non-occupant. 


(d) Standard deviation = fo? = np — p) = J8(0.4)(0.6) = V19.2 = 4.3418 
Example E3.37 


According to a particular survey, 14.9% of those who have received a doctor’s degree in engineering are 
blacks. Suppose that 6 people who have received their doctor’s degree in engineering are randomly selected. 
Find the probability that 


(a) exactly 2 are black 

(b) exactly 4 are black 

(c) at least 2 are black 

(d) find the probability distribution of the number of blacks in a sample of 6 persons who have received 
their doctor’s degree in engineering 

(e) why is the probability distribution obtained in part (d) only approximately correct? What is the 
exact distribution called? 


SOLUTION: 
Here, n = 6 and p = 0.149 and g = 1-p=1-0.149=0.851 
From Eq. (3.50), we have 


f@= " pipe 
D§ 


(a) P(2)= | (0.149)?(0.85 1)* = 0.1747 
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6 
(b) PA= ’ (0.149)*(0.851Y = 0.0054 


6 6 
(c) P(at least 2)=1-—P(Qor 1)=1- 4 (0.149)°(0.851)° — ") (0.149)'(0.851)° 
= 1 —0.3798 — 0.3990 = 0.2212 


6 
(d) pax=s=( ) (0.149}(0.851)*  forx=0, 1, 2, ..., 6. 


x 


Applying this formula for each value of x gives the results in the Table E3.37. 
Table E3.37 


(e) The sampling was actually done without replacement, so the trials are not independent and the 
success probability changes very slightly from trial to trial. The exact probability distribution is 
called a hypogeometric distribution. 


Example E3.38 


Suppose the probability is 0.67 that the favorite in a horse race will finish in the money (first, second, or 
third place). In the next 5 races, what is the probability that the favorite finishes in the money 


(a) exactly 2 times 
(b) exactly 4 times 
(c) at least 4 times 


(d) between 2 and 4 times, inclusive 


(e) find the probability distribution of the random variable, X, the number of times the favorite finishes 
in the money in the next 5 races 


(f) identify the probability distribution of X as right skewed, symmetric, or left skewed without checking 
its probability distribution or its histogram 


(g) Draw a probability histogram for X 

(h) find the mean and standard deviation of the random variable X. 
SOLUTION: 
Refer to the table in Appendix-C. 
Here n = 5, p= 0.67 and g=1-—p=1-067=0.33 
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From Eq. (3.50) we have 


n-X 


Ji)=|_ |p d=p) 


5) 


PQ) = (0.67)°(0.33) = 0.004 


0 


i ©:67)'(033y = 


2 


(0.67)(0.33) = 0.328 


5 
4 (0. 67)'(0.33)! = 0.332 


P6)= 5 


() 
0) 
“Li 
PQ)= a (0.679(0.33) =0.161 
“L 
“| 
| 


(0.67)°(0.33)° = 0.135 


(a) P(2)=0.161 

(b) P(4)=0.332 

(c) P(X 24)=P4) + PS) = 0.332 + 0.135 = 0.467 

(dq) PQ2SXS4=P(2)+ PB) + P4) =0.161 + 0.328 + 0.332 = 0.821 


(e) 


(f) Left-skewed 
(g) See Fig. E3.38. 


0.328 0.332 


P(X=x) 


Fig. E3.38 
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[x [Po = x) | xP 9) fa 


(h) Mean = pt = 3.349 (from the above table) 
Variance, 
Oo? = 12.323 — 3.349 = 1.107 


Hence, standard deviation = ./].107 = 1.052 


Alternate calculation: 
L=np= 30:67) = 355 
O° = np(i — p) = 5(0.67)(0.33) = 1.1055 


or o = ¥1.1055 = 1.051 


3.3.5 Poisson Distribution 


The Poisson distribution, named after the French mathematician Simeon D. Poisson is another important 
probability distribution of a discrete random variable that has many applications. The Poisson distribution 
is applied to experiments with random and independent occurrences. 

Given an interval of real numbers, assume events occur at random throughout the interval. If the interval 
can be partitioned into subintervals of small enough length such that 

1. the probability of more than one event in a subinterval is zero 

2. the probability of one event in a subinterval is the same for all subintervals and proportional to 

the length of the subinterval, and 


3. the event in each subinterval is independent of other subintervals, the random experiment is called 
a Poisson process. 

Independence of occurrences means that one occurrence (or non-occurrence) of an event does not 
influence the successive occurrence or nonoccurrences of that event. The occurrences are always considered 
with respect to an interval. The interval may be a time interval, a space interval, or a volume interval. The 
actual number of occurrences within an interval is random and independent. If the average number of 
occurrences for a given interval is known, then by using the Poisson probability distribution we can 
determine the probability of a certain number of occurrences, x, in that interval. Note that the number of 
actual occurrences in an interval is denoted by x. 


Conditions to apply Poisson probability distribution 

The following three conditions must be satisfied to apply the Poisson probability distribution: 
1. xis a discrete random variable 
2. The occurrences are random 
3. The occurrences are independent 
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The following are a few examples of discrete random variables for which the occurrences are random 
and independent. 


Some of the phenomena that follow the Poisson distribution are: 

counts of flaws in castings 

the number of vehicles on a highway 

the number of customers visiting a bank 

the number of accidents that occur on a given highway during a period of time 
the number of telephone calls 


counts of power outages 


NDF WN 


counts of atomic particles emitted from a specimen. 


The random variable X that equals the number of events in the interval is a Poisson random variable 
with parameter 0 < A, and the probability mass function (pmf) of X is 


—h5 x 
z a re01 0 (3.62) 
xa 


f= 
The Poisson distribution is made up of a series of terms: 


-i4 2 -~h4 3 
ee, © x ae N 


2! 3! 


seen 


representing respectively, the probabilities of occurrence of 0, 1, 2, 3, 4, etc. events, where e is the base of 
the natural logarithm and A is the mean frequency of occurrence. The sum of the probabilities is one (1) 
because 


co “Ay k nce ai 
Ie ao (3.62a) 


k k=0 


and the summation on the right-hand side of the equation (3.62a) is recognized to be Taylor’s expansion of 
e* evaluated at A. Hence, the summation equals e* and the right-hand side equals e *e* = 1. Poisson 


distribution for A = 3 is shown in Fig. 3.4(b). 


0.220 
0.200 
0.180 
0.160 
0.140 
0.120 
0.100 
0.080 
0.060 
0.040 
0.020 
0.000 


p(x) 
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3.3.5.1 Derivation from Binomial Distribution 


The Poisson distribution can also be deduced from the binomial distribution, provided that n is large (>), 
p is very small (-90), and np is finite. It was shown earlier that in n trials the probability of an event 


succeeding r times is hl 
P=) 
"  r\(n-r)! 


When n is large compared with r, 


ir)! 


Cee fe? 


q 


Therefore, the probability of r successes becomes 


ae =— p’ u-F 
r! 
Now if p is very small and r is not large, 
g=(1—pyr=1 
and q = qe p). 
Hence 2 = Dig =p) = ea 
ri r! 


r 2 3 
_ (mp) in py _ py, 
r! 2} 3! 


Thus P.= Le 
r! 


| 


Zh 


This, then, is the probability of 7 successes in n trials. 


3.3.5.2 Mean and Standard Deviation 


=n(n-1)(n—-2)..(n—rt+) =n" 


_na=DEpy , mn Nin= 2p’ 


5! 


The mean number of occurrences of an event per unit of time (or space) is 


[=np 


and the standard deviation of the numbers of events is 


c= 


Thus the mean and variance are equal to one another: 


=O =np 


Equations (3.66) and (3.68) will now be derived. 


mean = E(r)= }) rP. = >> _— =O+Ae™ +2 
r=0 r=0 r. 
as 
sae aaa 
2s! 


72e7* 367% 
sae de 
2} 


3! 


|-rere =A 


flere 


| 


(3.63) 


(3.64) 


(3.65) 


(3.66) 


(3.67) 


(3.68) 


(3.69) 


/! Random Variables and Probability Distributions // 121 


The variance 0° is given by 


O° = E(r—A)? = ¥(r-AyP. 


r=0 
o =P -2AV re +> P 
r=0 r=0 r=0 
NG —2A )) rP, = -20(A) = -2A? 
r=0 
SS a= eee 
r=0 
oo oo oo 42 -- 43 ~A 
and vrP = Sir -)+rJP, = ¥ r(r-)P. ai -|os0424 are x tela 
r=0 r=0 r=0 : 


=Ae% fisned on lanamereryenan 7 


Thus, collecting the terms, we have 
O=N+A-2VN 4+ 


or OA (3.70) 
Hence, if X is a Poisson random variable with parameter A, then 
the mean w= E(X) =A, and 
the variance ABAD. @ es7 6 


Note that the Poisson distribution contains only one parameter, np, the mean occurrence of an event, 
and we do not know the value of n. In the binomial distribution we know the number of times an event 
occurs and the number of times an event does not occur. 

Cumulative Poisson probabilities are given in Appendix-D. A plot of the Poisson distribution for A = 3 
is shown in Fig. 3.5. This Poisson distribution has a very long tail to the right (the distribution is skewed). 
0.24 0.224 0.224 
0.22 

0.2 
0.18 
0.16 
0.14 
0.12 

0.1 
0.08 
0.06 
0.04 
0.02 


p(x) 


0.0081 9 9027 0.0008 


x 
Fig. 3.5: Poisson distribution for 4 = 3 
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Example E3.39 


ct hl hc EE 


A survey found that 1.5% of occupied housing units have 7 or more people living within. Use the Poisson 
distribution to determine the approximate probability that, of 200 randomly selected occupied housing units, 
there are 


(a) none with 7 or more persons 


(b) 3 or more with 7 or more persons 
SOLUTION: 
We use A = np = 200(0.15) = 3.0. We also note that n 2 100 and np < 10. Hence, apply Eq. (3.62), 


ern? 


56 


f(x) = 


3.0 0 0 
(ape = OV co = 0.0498 


(b) P(at least 3) = 1—P(X =O or 1 or 2) 


] a 
=] |e al Per 7 see = = 1 — [0.0498 + 0.1494 + 0.2240] = 0.5768 
Example E3.40 


Assume the number of errors along a magnetic recording surface is a Poisson random variable with a mean 
of one error every 10° bits. A sector of data consists of 5000 eight-bit bytes. Find 


(a) the probability of more than one error in a sector 


(b) the mean number of sectors until an error is found. 
SOLUTION: 


Let x denotes the number of errors in a sector. Then, X is a Poisson random variable with A = 0.4. 
(a) PX>D=1-P(X%<1)=1-e%-e€%04) = 1-—0.6703 — 0.2681 = 0.06157 


(b) Let Y denotes the number of sectors until an error is found. Then, Y is a geometric random variable 
and 


P=P(X2 1)=1-P(X=0)=1-e™ = 1 — 0.6703 = 0.3297 


a = 3.03306 


E(Y)= L = 
p 0.3297 


Example E3.41 


a 
The probability that an individual recovers from an illness in a 2-week time period without medical treatment 
is 0.1. If 20 independent individuals suffering from this illness are treated with a medicine and 4 recover in a 


2-week period. If the medicine has no effect, what is the probability that 4 or more people recover in a 
2-week time period? 
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SOLUTION: 
Let x denotes the number of individuals that recover in 2-week time period. Assume the individuals are 
independent, then, X is binomial random variable with n = 20 and p=0.1. 


P(X 2 4) = 1~P(X $3) =1-[P(X=0) + P(x= 1) + Px =2)+ P(X =3)] 
20 eee 20) 2 20 eae Pan 40) 
= -| 5 on 0.9)" +| Jo.n'osy"+| ; Jo.nrosy" +| : Joos 
= 1—[0.1216 + 0.2702 + 0.2852 + 0.1901] = 0.1330 


Example E3.42 


A survey found that the traffic flowing through an intersection with an average of 3 cars per 30 seconds. 
Assume the traffic flow can be modeled as a Poisson distribution. 


(a) find the probability of no cars through the intersection within 30 seconds 
(b) find the probability of 3 or more cars through the intersection within 30 seconds 


(c) find the minimum number of cars through the intersection so that the probability of this number or 
fewer cars in 30 seconds is at least 90% 


(d) if the variance of the number of cars through the intersection per minute 1s 20, is the Poisson 
distribution appropriate? 


SOLUTION: 
(a) A =3cars/30 seconds 


—Ay x 
x 
we have f(x)= = 
x! 
340 
P(X=0) = 2 = 0.0498 


0! 
(b) P(X>3)=1-P(X<3)=1-[P(X=0)+ P(X = 1) + PX=2) 
|? ens. ere 


Sot tay | = 1 — 0.0498 + 0.1494 + 0.2240] = 0.5768 


(c) PXsx)2090rx=5 
(d) ©? =i=6. This is not appropriate. 


Example E3.43 
A student’s campus newspaper in a particular university contains an average of 1.2 typographical errors 
per page. | | 
(a) find the probability that a randomly selected page of this newspaper will contain exactly 
4 typographical errors using the Poisson formula | 
(b) find the probability that the number of typographical errors on a randomly selected page will be 
(i) more than 3 
(ii) less than 4 
Use the Poisson probabilities table in Appendix-D. 
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SOLUTION: 


Let x be the number of typographical errors on a randomly selected page of this newspaper. Since it contains 
an average of 1.2 typographical errors per page, A = 1.2. We have Eq. (3.62) 


e x 


f(x)= 


x! 
ve _ (2) en 
x! ! 
(b) (i) P(more than 3) = P(x > 3) = P(4) + P(5) + P(6) + P(7) = 0.0260 + 0.0062 + 0.0012 + 0.0002 = 0.0336 
(ii) P(less than 4) = P(x < 4) = P(O) + P(1) + P(2) + P(3) = 0.3012 + 0.3614 + 0.2169+ 0.0867= 0.9662 


Example E3.44 


In a statistics course final examination, 15% of the students fail. 
(a) find the probability that in a random sample of 100 students in that statistics class who took the 
final examination exactly 20 will fail. Use the Poisson formula. 


(b) find the probability that the number of students who fail this statistics final examination in a 
randomly selected 100 students is 


= 0.0260 


(a) Ptexactly 4) = P(x =4)= 


(i) at most 9 
(ii) 10to 16 
(iii) at least 20 
Use the Poisson probabilities table in Appendix-D. 


SOLUTION: 


Let x be the number of students in a random sample of 100 who fail the final examination in statistics. Since, 
an average, 15% of the students fail the examination, A = 0.15(100) = 15. 


en : (eds) 
“x! (20! 
(b) (i) P(at most 9 fail) = P(e < 9) = PO) + P(1) + P(2) + PQ) +++ + P() 
= 0,0000 + 0.0000 + 0.0000 + 0.0002 + 0.0006 + 0.0019 + 0.0048 + 0.0104 +0,0194 40.0324 
= 0,0697 
(ii) P10 to 16 fail) = PU0< x < 16) = P(10) + P(LL) + P(12) + P(13) + P(14) + P(15) + P(16) 
= 0,0486 + 0.0663 + 0.0829 + 0.0956 +. 0.1024 40.1024 + 0.0960 = 0.5942 
(iii) P(at least 20) = P(x > 20) = P(20) + P(21) + P(22) + + + P39) 
= 0.0418 + 0.0299 + 0.0204 + 0.0133 + 0.0083 + 0.0050 + 0.0029 + 0.0016 + 0.0009 +.0.0004 
+ 0.0002 + 0.0001 + 0.0001 + 0.0000 + «++ + 0,0000 = 0.1249 


Example E3.45 


A hardware store in a big city receives an average of 9.8 telephone calls per hour. 


(a) find the probability that exactly 6 telephone calls will be received at this store during a certain 
hour. Use the Poisson formula. 


(a) P(exactly 20 fail) = P(x = 20) = = 0.0418 
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(b) ate ae probability that the number of telephone calls received at this store during a certain hour 
Ww 
(i) less than 8 
(ii) more than 12 
(iii) Sto8 
Use the Poisson distribution table in Appendix-D. 


SOLUTION: 


Let x be the number of telephone calls received at this hardware store during a certain hour. Since the 
average number of telephone calls per hour is 9.8, A = 9.8. 


-Ay x -9.8 6 
a = (e7)0.8)" = 0.0677 
Wis 


6! 
(b) (i)  P(less than 8) = P(x < 8) = P(O) + P(1) + P(2) + P(3) + P(4) + P(5) + P(6) + P(7) 
= 0.0001 + 0.0005 + 0.0027 + 0.0087 + 0.0213 + 0.0418 + 0.0682 + 0.0955 = 0.2388 
(ii) P(more than 12) = P(x > 12) = P(13) + P(14) + P(15) + -:- 
= 0.0685 + 0.0479 + 0.0313 + 0.0192 + 0.01111 + 0.0060 + 0.0031 + 0.0015 + 0.0007 + 0.0003 
+ 0.0001 + 0.0001 = 0.1898 
(iii) P(S to 8) = P(S $x <8) = P(5) + P(6) + P(7) + P(8) = 0.0418 + 0.0682 + 0.0955 + 0.1170 = 0.3225 


Example E3.46 


An average of 0.7 accidents occur per day in a large city. 
(a) find the probability that no accidents will occur in that city on a given day 
(b) write the probability distribution of x, where x denotes the number of accidents that will occur in 
that city on a given day 
(c) find the mean, variance and standard deviation of the probability distribution developed in 
part (b). 


(a) P(exactly 6) =F(x=6)— 


SOLUTION: 
x = number of accidents on a given day in that city. 
Since the average number of accidents per day is 0.7, A = 0.7. 


ee oan U1 
x! 


= 0.4966 


(a) P(no accidents) = P(x =0)= 
(b) From the table in Appendix-D, we have 


126 // Probability and Statistics for Scientists and Engineers /I 


(c) The mean, variance and standard deviation are: 


mean p=A=17 
variance 6* = A = 0.7 and 


standard deviation, o= ./), = J0.7 = 0.8366 


Example E3.47 


The number of male mates of a queen bee was found to have a Poisson distribution with parameter A = 2.7. 
Find the probability that the number, N, of male mates of a queen bee ts 


(a) 
(b) 
(c) 
(d) 
(e) 


(f) 


exactly 2 

at most 2 

between | and 3, inclusive 

on average, how many male mates does a queen bee have? 


develop a table of probabilities for the random variable, N. Compute the probabilities until they are 
zero to 4 decimal places 


draw a histogram of the probabilities in part (c). 


SOLUTION: 


(a) 


(b) 


(fe) 
(d) 


He) 


5a es 
P\N=2)=e7! = ().2450 
OF 


“as 


PP ee re 
27 oe! = _4¢ 7! —__ = 0672 4016 15 02450 — 0 oan 


<= — = 
PVE 270 or 1 or 2)= € 0! 7 ir 2 


Poy) ieee. ere 
PU. SNS3)=P(1 or2 or 3)= e Fle. e ie e ris 0.1815 + 0.2450 + 0.2205 = 0.6470 
On average, the number of mates of a queen bee is 
= 2) 


Refer to Appendix-D. The results are shown in Table E3.47. 


Table E3.47 


OmearAINNMNBHWNK CO 
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(f) See Fig. E3.47 
0.3 


Fig. E3.47 


3.4. CONTINUOUS PROBABILITY DISTRIBUTIONS 


3.4.1. The Normal Distribution 


In everyday life, people deal with and use a wide variety of variables. Some of these variables — such as 
heights of people, scores in final examinations, aptitude test scores, TOEFL (Test of English as a Foreign 
Language), and GRE (Graduate Record Examination) share an important characteristic: their distributions 
have roughly the shape of a normal curve, that is, a special type of bell-shaped curve like one shown in 
Fige3.6. 


f(x) 


mean uU 
Fig. 3.6 Normal curve 


The normal distribution is in many respects the cornerstone of statistics. A random variable X is said 
to have a normal distribution with mean [1(—c° <  < o0) and variance o° > 0 if it has the density function 


Ff) = ee WI Wor ee G71) 
ov 20 
The distribution is illustrated graphically in Fig. 3.6. The normal distribution is used so extensively that 
the shorthand notation X ~ N({1, 6”) is often used to indicate that the random variable X is normally distributed 
with mean wand variance 0”. 
A variable is said to be a normally distributed variable or to have a normal distribution if its distribution 
has the shape of a normal curve. 
If a variable of a population is normally distributed and is the only variable under consideration, the 
general practice is to assume that the population is normally distributed or that it is a normally distributed 
population. In reality, a distribution is unlikely to have exactly the shape of a normal distribution curve. If a 
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variable’s distribution is shaped roughly like a normal curve, we can consider that the variable is an 
approximately normally distributed variable or that it has approximately a normal distribution. 

A normal distribution (and hence a normal curve) is completely determined by the mean and standard 
deviation. Hence, two normally distributed variables having the same mean and standard deviation must 
have the same distribution. We identify a normal curve by stating the corresponding mean and standard 
deviation and calling those to parameters of the normal curve. 

A normal distribution is symmetric about and centred at the mean of the variable, and its spread depends 
on the standard deviation of the variable. The larger the standard deviation, the flatter and more spread out 
is the distribution. 

In summary, the normal curve associated with a normal distribution is bell shaped, centered at u, and 
close to the horizontal axis outside the range from [1 — 36 to ut + 30 as shown in Figs. 3.6 and 3.6(a). 


Normal curve (i, 0) 


u-360 u-20 L-c um uto ut2o0 8 pt3o z 
Fig. 3.6(a) 


Random variables with different means and variances can be modeled by normal probability functions 
with appropriate choices of the centre and width of the curve. The value of E(X) = u determines the centre 
of the probability density function and the value of V(X) = 0? determines the width. Figure 3.6(c) illustrates 
several normal probability density functions with selected values of m and 6”. Each one has the characteristic 
symmetric bell-shaped curve, but the centers and dispersions differ. 


Definition of Normal Distribution 
A random variable X with probability density function 
x-pL 


Foye) 00 < K <0 
210 


is a normal random variable with parameters , where -co < x < ce, and o > 0. 
Also E(X)= and V(X) = 0° 


and the notation M(, 6°) is used to denote the distribution. The mean and variance of X are equal to and 
0’, respectively. 


f(x) 


Ss f= 16) 


Fig. 3.6(b): Normal probability density functions for 
selected values of the parameters p and o? 
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3.4.1.1 Properties of the Normal Distribution 


The normal distribution has several important properties: 


L [" fQar=1 


2. f(x) 20 for all x 

lim... f@) = 0 and lim, f(x) = 0 

F[(& + W)] =f [Hx — )}. The density is symmetric about 1. 
The maximum value of f(x) occurs at x = UL. 


nn WwW 


The points of inflection of f(x) are at x= 1+ 0. 


b 
Also, Pa<x<b)= | fax = area under f(x) from a to b for any a and b. 


a 


Property 1 may be demonstrated as follows. Let y = (x — p)/o in Eq. (3.71) and denote the integral as /. 
That is, 


1 SS SA 1 °° _(1/2)x? 
a SSS ’ dy-—— Tax 3u/2 
a anil ese 


on changing to polar coordinates with the transformation of variables v = r sin 8 and z = r cos 8, the integral 
becomes 


nara , A 
oe ie 20 e— pes eee (3.73) 
LOT 0 


3.4.1.2 Mean and Variance of the Normal Distribution 


The mean of the normal distribution may be determined easily. Since 


E(X)= f ee MWg (3.74) 


*oV2n 


and if we let z = (x — 1)/o, we obtain 


E(X)= Iz Fut ze? dz (3.75) 


dz pee ‘12 dy (3.76) 


elt Vn 


Since the integrand of the first integral is that of a normal density with p = 0 and o° = 1, the value of the 
first integral is one. The second integral has value zero, that is, 


oo 


-77/2 0 


—°co 


fog 
N 


2 1 
=i JD 
daz =—= ze 
al J2n 


and thus E(X)=p{1] = of0] = (3.77) 
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To find the variance we must evaluate 


1 


ov 27 


aT 2 
a p)/o] dx 


V(X) = EX -py l= | @-py 


And letting z = (x — y)/o, we obtain 


° 1 a ) " Zz =o 
Vixy= | a7 ae acne | Tm ae 
ae 2n 


27/2 


3 | of i * f 1 
J2n | 2.v2n 
so that Vivace (3.78) 


In summary the mean and variance of the normal density given in Eq. (3.78) are us and 0”, respectively. 


eo? az = 67041) 


3.4.1.3 The Cumulative Normal Distribution 
The distribution function F is 


co 


1 a/2y(x-p/o? 
FQ)= P(X Se a | 3.79 
Neoge Cfo 
It is impossible to evaluate this integral without resorting to numerical methods, and even then the 
evaluation would have to be accomplished for each pair (u, 6”). However, a simple transformation of variables, 
z = (X — ))/o, allows the evaluation to be independent of lt and o. 


That is, 


eaall (x-p/o 1 _i2y9 (x-)/0 (=#) 
F(X) = P(X $ x)= P| ZS = — = zee 
Oe 00, : ie Rae de fo o@az = 6 - (3.80) 


3.4.1.4 The Standard Normal Distribution 
The probability distribution in Eq. (3.80) above, 


1 etl 


10) 00 < x <0 
1 


is a normal distribution with mean 0 and variance 1; that is, Z ~ N(O, 1) and we say that Z has a standard 
normal distribution. A graph of the probability density function is shown in Fig. 3.7. 


0 
Fig. 3.7: The standard normal distribution 
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The corresponding distribution function is o, where 


oa)= | 4 


rae ee 
aoe (3.81) 


where z=- 


Q 


and this function has been well tabulated. 
A few useful results for a normal distribution are summarised below and shown in Fig. 3.7(a). For random 
variable, 
P(Uu-0 <X<w+6) =0.6827 
P(u— 26 <X <p +20) = 0.9545 
P(u-30 <X <p +30) = 0.9973 
In other words, any normally distributed variable has the following properties: 
1. 68.27% of all possible observations lie within one standard deviation to either side of the mean. 
that is, between u — 6 and w+. 
2. 95.45% of all possible observations lie within two standard deviation to either side of the mean, 
that is, between uw — 20 and pt + 20. 
3. 99.73% of all possible observations lie within three standard deviation to either side of the mean, 
that is, between p — 30 and pu + 30. 
These properties are shown in Fig. 3.7(b). 


L_ 45% 


= as +20 
—2 5 Zz 
(b) 


Fig. 3.7(a) 


In addition, from the symmetry of f(x), P(X >) = P(X < pW) = 0.5. The above model assigns some 
probability to each interval of the real line since f(x) Is positive for all x. The probability density function 
decreases as x moves further from p. As a result, the probability that a measurement falls far from Yt is small. 
and at some distance from u the probability of an interval can be approximated as zero. 

For a normally distributed variable, the percentage of all possible observations that (i.e., within any 
specified range) equals the corresponding area under its associated normal curve, expressed as a percentage. 
This result holds approximately for a variable that is approximately normally distributed. 

A normally distributed variable having mean 0 and standard deviation | is said to have the standard 
normal distribution. Its associated normal curve is called the standard normal curve. which is shown in 


Fig. 3.7(c). 
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The area under a normal probability density function beyond 30 from the mean is quite small. Since 
more than 0.9973 of the probability of a normal distribution is within the interval (u — 30, 4 + 30), 66 is often 
referred to as the width of a normal distribution. Advanced integration methods can be used to show that 
the area under the normal probability density function from —ce < x < eis 1. 


3.4.1.5 Problem-Solving Procedure 
If X is a normal random variable with E(X) = uw and V(X) = 0°, the random variable 


is anormal random variable with E(Z) = 0 and V(Z) = 1. That is, Z is a standard normal random variable. 


Suppose X is a normal random variable with mean w and variable o’. Then 


Poe pf <# <2) =P 
oO (@) 


os 
where Z is a standard normal variable, and z = ([==#] is the z-value obtained by standardising X. 


The probability is obtained by using the table in Appendix-E with z = (x — y/o. 


Basic Properties of the Standard Normal Curve 
1. The total area under the standard normal curve is 1. 


2. The standard normal curve extends indefinitely in both directions, approaching, but never touching, 
the horizontal axis as it does so. 


3. The standard normal curve is symmetric about 0. That is, the part of the curve to the left of the 
vertical line at the centre in Fig. 3.7(5) is the mirror image of the curve to the right of it. 


4. Almost all the area under the standard normal curve lies between —3 and 3. 


f(x) 


Fig. 3.7(b): Probabilities associated with a normal distribution 
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-3 -2 -1 0 1 2 3 
Fig. 3.7(c): Standard normal distribution 


A normal random variable with 1 = 0 and 6? = 1 is called a standard normal variable and is denoted as Z. 
The cumulative distribution function of a standard normal random variable is denoted as ee) = P( Lise), 
The tables in Appendix-E provides cumulative probabilities for a standard normal random variable. 


The procedure in solving problems involving calculating the cumulative normal probabilities is simple. 
For example, suppose that X ~ N(50, 4), and we wish to find the probability that x is less than or equal to 
54; that is P(x < 54) = F(54). Since that standard normal random variable is 


o 
we can standardize the point of interest x = 104 to obtain 
et eee 
Zé ——— ———————— 
o 2 


Now the probability that the standard norma! random variable z is less than or equal to 2 is equal to 
the probability that the original normal random variable x is less than or equal to 54. Expressed 
mathematically, 


2 


BOs (4) = o(z) 


F (54) = 0(2) 
The table in Appendix-E contains cumulative standard normal probabilities for various values of <. 
From this table, we can read 
(2) = 0.9772 
Note that in the relationship z = (x — 1)/o, the variable z measures the departure of x from the mean wu in 


standard deviation (0) units. In our example F(54) =f (2) indicates that 54 is two standard deviations (6 = 2) 
above the mean. In general, x = u + © z. In solving problems, we sometimes need to use the symmetry 
property of @ in addition to the tables. 

In order to find the percentages of all possible observations of a normally distributed variable that lie 
within any specified range, we express the range in terms of z-scores and then determine the corresponding 
area under the standard normal curve. The stepwise procedure to determine a percentage or probability for 
a normally distributed variable is presented below: 

1. Sketch the normal curve associated with the variable as shown in Fig. 3.7(d). 


2. Shade the region of interest and mark its delimiting x-value(s). 
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3. Compute the z-scores for the delimiting x-value(s) found in step 2. 

4. Use the table in Appendix-E to find the area under the standard normal curve delimited by the 
z-score(s) found in step 3. 

The probability distributions presented in this chapter are summarised in Hable 320 


a 


\ CW Normal curve (1, 0) 
\ \ \ ~ 
SSX GS35c 
WA KH 


Fig. 3.7(d) 


Table 3.1: Summary of probability distributions 


Discrete 


Hypergeometric 
x =max(0,n—N+K), 1, ... 
min(k,n), K<N,n<N 
Binomial ; 
| Dp: (1 = a : 
i 
=O 1, 98 0S p= 1 
Poisson 
Continuous 
Normal 


OO KX <S0, co KU < oo, OG > 0 


Example E3.48 


(a) Determine the area under the standard normal curve that lies to the left of 1.24, as shown in 
Fig. E3.48(a). 
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(b) Determine the area under the standard normal curve that lies to the right of 0.77. as shown in 
Fig. E3.48(b). ' 


((@) Determine the area under the standard normal curve that lies between —0.69 and 1.83, as shown in 
Fig. E3.48(c). 


Z 


z=0.77 
(b) 
Fig. E3.48 


SOLUTION: 


(a) Refer to the table in Appendix-E. First, we go down the left hand column. labeled z, to "1.2". Then. 
going across that row to the column labeled “0.04”, we reach 0.892512. This number is the area 
under the standard normal curve that lies to the left of 1.24, as shown in Fig. E3.48(d). 


(b) Because the total area under the standard normal curve is |, the area to the right of 0.77 equals | 
minus the area of the left of 0.77. We find this by first giving down the c-column to "0.7". Then. 
going across that row to the column, labeled “0.07”, we reach 0.779350, which is the area under the 
standard normal curve that lies to the left of 0.77. Thus, the area under the standard normal curve 
that lies to the right of 0.77 is 1 — 0.779350 = 0.220650 as shown in Fig. E3.48(e). 


(c) The area under the standard normal curve that lies between —0.69 and 1.83 equals the area to the 
left of 1.83 minus the area to the left of -0.69. The table in Appendix-£ shows that these latter two 
areas are 0.966375 and 0.245097, respectively. Hence the area we seek is 0.966375 — 0.245097 = 
0.721278, as shown in Fig. E3.48(f). 

Area = 0.966375 — 0.245097 

= 0.721278 


Area = 0.89251 Area = 1— 0.779350 


= 0.220650 


Z Z z£ 
Z= 1.24 z=0.77 z=-0.69 2=1.83 
(d) (e) (f) 
Fig. E3.48 


Determine the z-score having an area of 0.06 to its left under the standard normal curve. as shown in 


Fig. E3.49. 


Area = 0.06 


z=? 


Fig. E3.49 
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SOLUTION: 


See Fig. E3.48(a). 


Search the body of the table in Appendix-E for the area 0.06. There is no such area, so use the area closest 
to 0.06. which is 0.060571. The z-score corresponding to that area is —1.55. Hence, the <-score having area 
0.06 to its left under the standard normal curve is roughly —1.55, as shown in Fig. E3.49(a). 


Area = 0.06 


z= -1.55 
Fig. E3.49(a) 

Example E3.50 
Use the table in Appendix-£ to find 

(a) 0.025 

(b) “0.05 
SOLUTION: 

(G@)  Syo 8 the z-score that has an area of 0.025 to its right under the standard normal curve, as shown 


in Fig. E3.50(a). Because the area to its right is 0.025, the area to its left is 1 — 0.025 = 0.975, as 
shown in Fig. E3.50(b). Table in Appendix-£ contains an entry for the area 0.975002, its 
corresponding z-score is 1.96. Thus, z,,,, = 1.96, as shown in Fig. E3.50(5). 

(b)  ,,< is the z-score that has an area of 0.05 to its right under the standard normal curve, as shown in 
Fig. E3.50(c). Because the area to its right is 0.05, the area to its left is 1 — 0.05 = 0.95, as shown in 
Fig. E3.50(d). Table in Appendix-E does not contain any entry for the area 0.95 and has two area 
entries equally closest to 0.95 — namely, 0.949497 and 0.950529. The z-scores corresponding to 
those two areas are 1.64 and 1.65 respectively. So our approximation of is the mean of 1.64 and 
1.65, that is = 1.645, as shown in Fig. E3.50(d). 


= 
£0.05 
7 “0.05 


Area = 0.975 


Area = 0.025 Area = 0.025 


Fig. E3.50 
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Example E3.51 


Find the two z-scores that divide the area under the standard normal curve into a middle 0.95 area and two 
outside 0.025 areas, as shown in Fig. E3.51(a). 


Fig. E3.51 


SOLUTION: 


The area of the shaded region on the right in Fig. E3.51(a) is 0.025. The corresponding z-score. Semon: 96. 
Because the standard normal curve is symmetric about 0, the z-score on the left is —1.96. Therefore the two 
required z-scores are +1.96, as shown in Fig. E3.51(d). 


Example E3.52 


In a long distance run of 10 km in a city the times of finishers are normally distributed with mean 71 minutes 
and standard deviation 9 minutes. 


(a) determine the percentage of finishers with times between 60 and 80 minutes 
(b) find the percentage of finishers with times less than 85 minutes 

(c) obtain and interpret the 40" percentile for the finishing times 

(d) find and interpret the 8" decile for the finishing times. 


SOLUTION: 


(a) For finishers with times of 60 and 80 minutes, the z-values are 


r- - is S0a 
oO 9 oO 9 
The area to the left of z = —1.225 is 0.111233 and the area to the left of s = 1.00 is 0.841345. Hence, 
the area between z = —1.22 and z = 1.00 is 0.841345 — 0.111233 = 0.730112. Thus, the percentage of 

finishers with times between 60 minutes and 80 minutes in that city 10 km run is 73.01%. 


(b) For finishing time of 85 minutes, the z-value is 


= 1.00 


~ 


The area of the left of z = 1.56 is 0.9406. Thus, the percentage of finishers with times less than 


85 minutes is 94.06%. 
(c) Using the table in Appendix-£, we find that an area of 0.40 lies to the left of < = -0.25. We convert 
this z-value to an x-value using x = ub + co. Hence 40% of the finishers had times less than 40" 


percentile, x = 71 + (—0.25)(9) = 68.75 minutes. 
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(d) The 8" decile is the same as the 80" percentile. Using the table in Appendix-E, we find that an area 
of 0.80 lies to the left of z = 0.84. We convert the z-value to an x-value using x = ¢ + zo. Thus, 80% 
of the finishing times were less than 71 + (0.84)9 = 78.56 minutes. 


Example E3.53 


ee 
If X has a normal distribution for which the mean is 1 and standard deviation is 2, find the value of each of 
the following: 

(2) “PX S3) 

(b) PUXS 15) 

(c) PX=1) 

(a) P@eX <5) 

(e) P(X20) 

(f) P(-1<X<05) 

(g) P(\X|<2) 

(h) POU S-2X+3<8) 


SOLUTION: 


Given ut = | and standard deviation = 2. Refer to the table in Appendix-E for cumulative standard normal 
standard distribution. 


eS | 


fe} P. 
then z has a standard normal distribution. 

(a) P(XS3)=P(Z¥< 1) = @(1) = 0.841345 

(b) P@S15)=PZ> 025) = 1—8(0:25) = 04013 

(c) P(X =1)=0, because X has a continuous distribution. 

(d) P(2<X<5)=P(05 <Z< 2) = (2) — B(0.5) = 0.2858 

(e) PX20=P22-05) =FizZ=05)= O05) =0G915 

(f) P(-1<X<05)=PC1<Z<-—-0.25) = P(0.25 < Z< 1) = B(1) — B0.25) = 0.2426 

(2) PUSH PC sXs2)=P-lSSZ2<05)=FiZ<05) PZ = er <0 a eee 
= (0.5) — [1 — ®(1.5)] = 0.6247 

(h) PUS-2X+3<8)=P(2<-2X<5)=P-25<Xs1)=P(-1.75 <Z<0)=PO<Z§¢ 1.75) 
= 0(1.75) — B(0) = 0.4599 


Example E3.54 
Assume X is normally distributed with a mean of 10 and a standard deviation of 2. Determine the following: 
(a) P(X<13) 
(b) P(X>9) 
(c) P6@<X<14) 
(d) P(2<X<4) 
(e) P(2<X<8) 


~ 
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SOLUTION: 


Refer to the table in Appendix-E. 


(a) P(XX<13)= pz< ot 


0 
= P(Z < 1.5) = 0.93319 


(b) P(X>9)=1-P(X<9)=1-P(Z<-05) =0.69146 


(c) PO<X<14)= pf P< ze ue 


=PG2<7<2)=(P(Z<2) -PiZ<-))| =0955 


Ke Whee 


(d) P2Q<X<4)= pj = 


=P(i4<Z<-3)=PZ<-3)-P(Z<-4) =0.00132 


~ 


(e) P(2<X<8)=P(X<8)-P(X<-2)= p(z<2*) 7 ey 
= PZ<-1)—-P(Z<-6) = 0.15866 


Example E3.55 


The compressive strength of samples of cement can be modeled by a normal distribution with a mean of 
7000 kg/cm? and a standard deviation of 100 kg/cm*. 


(a) find the probability of a sample’s strength that is less than 7250 kg/cm? 
(b) find the probability that a sample’s strength is between 6800 and 6900 kg/cm” 
(c) what strength is exceeded by 95% of the samples? 

SOLUTION: 

Refer to the table in Appendix-E. 


7250 —7000 
(a) P(X <7250)= of z < | = P(Z < 2.5) = 0.99379 
6800 — 7000 6900 — 7000 
(b) P(6800 < X < 6900) = p| eT <7 = P(-2< Z <=) 


= P(Z<-1)—P(Z<-2) =0.13591 


~7000 
(c) Pixaaj= P[z>2 cai ) =095 


x— 7000 ze 


Hence, =1.65. and x= 6835. 


(a) A process manufactures ball bearings whose diameters are normally distributed with mean 
3.505 cm and standard deviation 0.008 cm. Specifications call for the diameter to be in the interval 
3.5 + 0.01 cm. What proportion of the ball bearings will meet the specification? 
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(b) Suppose the process can be recalibrated so that the mean will equal to 3.5 cm, the centre of the 
specification interval. The standard deviation of the process remains 0.008 cm. What proportion of 
the diameter will meet the specifications? 


SOLUTION: 

Let X represents the diameter of a randomly chosen ball bearing. Then, X ~ N(3.505, 0.0087). Figure E3.56(a) 
shows the probability density function of the N(3.505, 0.008*) population. The shaded area represents 
P(3.49 < X < 3.51), which is the proportion of ball bearings that meet the specification. 


(a) We compute the z-scores of 3.49 and 3.51 as follows: 


gx 3-505 ge yee 3.505 gee 


0.008 : 0.008 

The area to the left of z =—1.88 is 0.0301. The area to the left of z = 0.63 is 0.7357. The area between z = 0.63 
and z = —1.88 is 0.7357 — 0.0301 = 0.7056. Hence, approximately 70.56% of the diameters of the ball bearings 
will meet the specifications. 

(b) The mean is 3.5000 rather than 3.505. The calculations are as follows: (see Fig. E3.56(b)). 

49-3. 3.51-—3.50 
ee 25795 and z =———_——- = 1.25 

0.008 0.008 

The area to the left of z =—1.25 is 0.1056. The area of the left of z = 1.25 1s 0.8944. The area between z = 1.25 
and z = —1.25 is 0.8944 — 0.1056 = 0.7888. Hence, recalibrating will increase the proportion of diameters that 
meet the specification to 78.88%. 


ZY | y, x - 


” 
4&j 


3.49 3.505 3.51 3.49 3.51 
z= -1.88 z= 0.63 z= -1.25 z=1.25 
Fig. E3.56(a) Fig. E3.56(b) 
Example E3.57 


EEE 
Assume X is normally distributed with a mean of 5 and a standard deviation of 1. Determine the value of x 
that solves each of the following: 


(a) PAX>x)=05 

(b) PX>x=0.95 

(c) Re <%as)=0.2 

(d) Pl-x<(X-5)<x]=0.95 
(e) Pl-x<(X-5)<x]=0.99 
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SOLUTION: 
(a) PX<x= of z — 105 


x-5 
| 


Therefore, =0andx=5S5, 


| =5 2 
(b) PX <x)= p(z>=—) = 1-P(z<**) = 0,95 


Therefore, rf z & | = 0.05 and 


x-5 
9) 


a 


=— 1.64 and hence x = 3.36. 


(Po 2 x 25) = p 2 <z<0] 209 


-5) = 
here (ore: P| Z< a EO oiadd a Sei seanditience : = 48 


(d) P[S-x<X<5+x]= p($<x <2) = 0,95 


Therefore, x/2 = 0.98 
and x= 1.96 
(e) PiS—x<X<5+x]=Pl-W/2 <X<.x/2]=0.99 
Therefore, x/2 = 1.79 
and x = 2.58. 


Example E3.58 
The diameter of a metal shaft for a precision instrument is assumed to be normally distributed with a mean 
of 0.5 mm and a standard deviation of 0.025 mm. 
(a) what is the probability that shaft diameter is greater than 0.31 mm? 
(b) what is the probability that shaft diameter is between 0.235 and 0.315 mm? 
(c) the diameter of 90% of samples is below what value? 


SOLUTION: 
0.315-—0.25 
(a) P(X>031)= A(z > | = P(Z > 3) = 0.00135. 
(hb) PO235<X<0315)=P06<2< 2.6) = P(Z < 2.6) - P(Z< 0.6) = 0.99534 — 0.27425 = 0.72109. 
x-0.25 

THO. Cas 1e\| 4 = 0.90 
Be <2) 0.025 

Hence, X= 0:25 |e} 

O02 


and x=0.282 
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Example E3.59 
The length of a metal rod used in a machine system is normally distributed with a length of 45.10 mm and a 


standard deviation of 0.05 mm. 
(a) what is the probability that a rod is longer than 45.15 mm or shorter than 49.85 mm? 
(b) what should the process mean be set as to obtain the greatest number of rods between 44.85 and 
45.15 mm? 
(c) if the rods that are not between 44.85 and 45.15 mm are scrapped, what is the yield for the process 
mean that one would select in part (b)? 


SOLUTION: 


45.15 —45.10 


) <Z |= P(l< Z) =1—0.084134 = 0.15866 
0.05 


(a) P(45.15< X)= Al( 


49.85 — 45.10 


= P(Z<-5)=0 
0.05 | me 


HOS ASS) = fz < 
Hence, the answer is 0.159. 
(b) The process mean should set at the centre of the specifications, that is, at p = 45.0 


4485-45. 45.15-—45.0 
(c) P(44.85< X <45.15)= jae. (_“*° 


= P(-3< Z <3) =0.9973 
0.05 0.05 


Example E3.60 


Refer to Example E3.59. If the process is centred so that the mean is 45 mm and the standard deviation is 
0.05 mm and that 10 rods are measured and are assumed to be independent, determine 

(a) the probability that all 10 rods measured are between 45.85 and 45.15 mm 

(b) the expected number of the 10 rods that are between 44.85 and 45.15 mm. 


SOLUTION: 
(a) P(44.85 < X < 45.15) = 0.9973 from Example E3.59. Therefore, by independence, the probability of 
10 rods are within the given limit is 0.9973" = 0.9733. 


(b) Let ¥ denotes the number of rods from the 10 selected that are within the given limits. Then, Y is 
binomial with n = 10 and p = 0.9973. 


Hence, BLY)=9.973. 


Example E3.61 


SS 


The weight of a electronic component is normally distributed with a mean of 6 ounces and a standard 
deviation of 0.25 ounce. 


(a) find the probability that the electronic component weighs more than 6.5 ounces. 


(b) what must the standard deviation of weight be in order for the company to state that 99% of its 
electronic components are less than 6.5 ounces? 
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(c) if the standard deviation stays at 0.25 ounces, what must the mean weight be in order for the 
company to state that 99.9% of its electronic components are less than 6.5 ounces? 


SOLUTION: 
6.5-—6 
(a7) P(X >65)= oz > =/(Z> 2)=0.02275 
0.25 
6.5-6 
(b) If P(X < 13) =0.999, then az < = 0.999 
Hence, — =3.09 
o=0.1618 


(c) If P(X <65) = 0,999, then n A z< 


= 0.999 


FP = 3.09 and p= 5.7275 


Therefore, te 


Example E3.62 
The weight of a mechanical component is normally distributed with a mean of 22 oz and a standard deviation 
of 0.5 oz. 
(a) what is the probability that a component weighs more than 23 oz? 
(b) what must the standard deviation of weight be in order for the company to state that 99.9% of its 
components are less than 23 oz? 
(c) if the standard deviation remains at 0.5 oz, what must the mean weight be in order for the company 
to state that 99.9% of its components are less than 23 0z? 
SOLUTION: 
Refer to the table in Appendix-E. 


=P(Z>2)=1-PZs2)=1 —0977250=0.022750: 


(ay P(X > 237 |e 


(b) If P(X <23)=0.999, then 


7 
P| z< 23-22 


2 = 0.999 


| 
=3.09ando= —— =0.324 


i 
Hence, = : 3.09 


23 
(c) If P(X <23) =0,999, then plz st |= 0,999 


Hence, ime = 3.09 
0.5 


or p= 21.455. 


144 // Probability and Statistics for Scientists and Engineers I 


Example E3.63 


hh ce EE 
Refer to the Example E3.56. Assume that the process has been recalibrated so that mean diameter is now 
3.5 cm. To what value must the standard deviation be lowered so that 95% of the diameters will meet the 


specifications? 


SOLUTION: 


The specification interval is 3.49 — 3.51 cm. We must find a value for 0 so that this interval spans the 
middle 95% of the population of ball bearing diameters, as shown in Fig. E3.63. The z-score that has 2.5% of 
the area to its left is z = —1.96. The z-score that has 2.5% of the area to its right is z = 1.96 (from the symmetry 
of the curve). It follows that the lower specification limit, 3.49, has a z-score of —1.96, while the upper limit 
of 3.51 has a score of 1.96. Either of these facts may be used to find o. 


Now Z=- 


3.51-—3.50 
oO 
or o = 0.005102 cm 


or 1.96 = 


95% 
3.49 3.50 3.51 
z= -1.96 z= 1.96 


Fig. E3.63 


Example E3.64 


An experiment needs a 2.41 cm diameter steel rod. Suppose that the diameter of a steel rod has a normal 
distribution with a mean of 2.41 cm and a standard deviation of 0.01 cm 


(a) determine the probability that a diameter is greater than 2.42 cm 
(b) what diameter is exceeded by 95% of the samples? 
(c) if the specifications require that the diameter is between 2.39 cm and 2.43 cm, what proportion of 
the samples meet specifications? 
SOLUTION: 
Refer to the table in Appendix-E. 
(a) Let X denotes the diameter of the steel rod. 
X ~ N(2.41, 0.01’) 
2.42-2.41 


P(X > 2.42) = 1-— P(X $2.42) =1- o| 
0.01 


= 1- @(1) = 0.1587 
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—2.41 
(b) PX&>x= P| z> ai = 0.95 
x-—2.41 
Hence, nar Ol = 1.645 and x = 2.3936 cm 


(c) P(2.39<X<2.43)= oe] id 0 2-24 


0.01 coe Oe 


3.5 | APPROXIMATING PROBABILITY DISTRIBUTIONS 


In some engineering quality control applications, it is quite useful to approximate one probability distribution 
by another. Figure 3.8 shows the approximation guide, that is, the conditions under which one distribution 
may be approximated by another. 


Hypergeometric 


Binomial 


np > 10 
n(1 —p)>10 
p< 1/2 


p< 0.1, n large 


Fig. 3.8: Approximation guide 


3.5.1 Binomial Approximation to the Hypergeometric 

When the ratio n/N is smail, that is, say n/N < 0.1, then the binomial distribution parameter p = D/N can be 
used as a good approximation to the hypergeometric distribution. The smaller the function n/N, the better 
the approximation. 


3.5.2 Poisson Approximation to the Binomial 
The Poisson distribution can be used as an approximation to the binomial distribution when the sample 
size n is large and the probability of success p is small (the same applies when q is small, p and q being of 
course interchangeable), i.e., when the binomial distribution is highly skewed. As a guide, we can say that 
a good approximation is obtained when n 2 20 and p < 0.05, and the approximation improves with a decrease 
in p. 

The advantage of using the Poisson distribution to approximate the binomial distribution is that the 
Poisson distribution, having only one parameter, is well tabulated. 

When the parameter is small, say, p < 0.1, and 7 is large, with A. = np constant, the Poisson distribution 
is used as an approximation to the binomial distribution. The larger the value of 1 and the smaller the value 


of p, the better the approximation. 
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3.5.3 Normal Approximation to the Binomial 


Recalling that if X ~ Bin (#7, p), then X= Y,+ Y, +--+ Y,, where Y, + Y, +--+ ¥, is a sample from a Bernoulli 
(p) population. Hence, X is the sum of the sample observations. 


The sample proportion is 
X Yrytety, 


| eae 
n n 


which is also the sample mean Y . The Bernoulli population has mean jt = p and variance o° = p(1 — p). It 
follows from the central limit theorem (see section 3.8) that if the number of trials is large, then 


X ~ N(np, np( — p)), and p ~ N(p, p( —pin)). 


In the binomial case, the accuracy of the normal approximation depends on the mean number of 
successes np and the mean number of failures n(1 — p). The larger the value of np and n(1 — p), the better 
the approximation. Usually, the normal distribution is used as an approximation to the binomial distribution 
when np and n(1 — p) are both greater than 5. A better and more conservative rule is to use the normal 
approximation whenever np > 10 and n(1 — p) > 10. 


Hence if X ~ Bin(n, p) and if np > 10 and n(1 — p) > 10, then 
X ~ N(np, np(1 — p)) approximately 


p~ nf p PP) approximately 
i” , 
Hence, if X is a binomial random variable with parameter n and p 


on 
Vnp(1— p) 


is approximately a standard normal random variable. 


If 7 is large, then one can justify the approximation of the binomial distribution by the normal distribution 
with mean np and variance np(1 — p). Noting that the binomial distribution is discrete and the normal 
distribution is continuous, we can make a continuity correction such that 


ea) of eit |_| eae 


Vnp(l- p) np(1— p) 


where ®(-) denotes the cdf of the standard normal distribution. We also write probability statements as 


Plas x cnao| Crtticre || ramet 


ynp(l— p) vnp(l- p) 


Although p should be about 0.5, the approximation can be used without excessive loss of accuracy for 


0.1 <p < 0.9. If p is close to 0.5 and n > 10, the approximation is fairly good. For other values of p, the value 
of n should be larger. 


(3.82) 


(3.83) 


In general, the approximation is good as long as np > 5 for p < 0.5 or when nq > 5 when p > 0.5. 
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The normal distribution is also used to approximate the sample fraction nonconforming, p. The random 


variable p is normally distributed with mean p and variance p(1 — p)/n, so that we have 


p[es% <2]. ps p<? 
nh n n n nt 


which indicates that we divide all terms in Eq.(3.83) by 7 to obtain 


b+1/2 . Caz 
7 ies © 
p{2<p<*)-0 ee 
n n p(l- p) p(l- p) (3.84) 
n n 


Continuity Correction: The binomial distribution is discrete, while the normal distribution is continuous. 
The continuous correction is an adjustment, made when approximating a discrete distribution with a 
continuous one that can improve the accuracy of the approximation. 


3.5.4 Normal Approximation to the Poisson 
If X is a Poisson random variable with E(X) = A and V(X) =A 


Paes 


VA 
is approximately a standard normal random variable. The approximation is good for A > 10. 


The normal distribution with mean Lt = 6° = A can be used as an approximation to the Poisson distribution 
if the mean A of the distribution is large, say 10 or more. 


The continuity correction used is given by 


Grek | 9 GO| 
ai Vi 


Example E3.65 2a——— 


The number of widgets made per shift is equal to 1000. It is known that 8% of them are nonconforming. A 
sample of 20 widgets is taken. What is the probability that 2 or fewer widgets in the sample are 
nonconforming? 


P(as X <b)= o| (3.85) 


(a) set up the equation using the hypergoemetric distribution 
(b) solve it using the binomial approximation 
(c) solve it using the Poisson approximation. 


SOLUTION: 
D=80, N= 1000, n = 20, p = 0.08 


wall 920 
2\ x /\20-x 
UCSD 16 O85 95 ee) eer ree, aa 
g = (oa 
20 
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2.20 . 
Cia =.) — >| |(o0s 02)" 
510) x 
2 5 2 20 9 
= i (0.08)° (0.92)° +| "| (0.08)'(0.92)'” +| ; (0.08)? (0.92)'® 
= 0.189 + 0.328 + 0.271 = 0.788 
(c) N=np=200.08) = 1.6 
P(x <2) =0.783 


Example E3.66 


If 20% of the electronic components made in a company are nonconforming, find the probability that in a 
random sample of 100 such electronic components selected and using normal approximation. 


(a) at most 16 will be nonconforming 
(b) exactly 16 will be nonconforming 
(c) between 16 and 22 will be nonconforming. 


SOLUTION: 


Normal approximation 
np = 1000.20) =20- Jnp(l— p) =4 


1620 
(a) Pxsl6)= | 2-2) = @(-0.809) = 1 — 0.809 = 0.191 
16.5= 20 be 20 
(b) Px=16)= | S22 |g BS=7 |. 0.191 — o(—1.125) = 0.191 — (1 — 0.855) = 0.046 


(PEON) ser) 
(c) Fee x= 22) — o| 5-2-9] 25-2 = (0.65) — 0.14 = 0.734 — 0.145 = 0.589 


Example E3.67 


Use the normal approximation to the binomial distribution to determine the probability of getting 7 heads 
and 7 tails in the 14 tosses of a balanced coin. 


SOLUTION: 


We must thus determine the area under the curve between 6.5 and 7.5.n = 14 andx=7, U= 1a al = 7 and 


o= fio{2)( 4} = 55 =180 
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Z, = —0.27 if Z, = 0.27 Number of heads 
6.5 7.5 
Fig. E3.67 
Sa / =_0.27 and z gees! oor 
lio. Cc oe 


The table for normal distribution in Appendix-E corresponding to z = —0.27 and 0.27 are 0.393580 and 0.60642 
and we find that the normal approximation to the probability of “7 heads and 7 tails” is 0.60642 — 0.393580 = 
0.2128. 


The table for binomial approximation in Appendix-C gives 0.2095. Hence, the error of the approximation is 


0.0033 
0.2095 — 0.2128 =-—0.0033 and the percentage error is ———— 0.2095 (100) = 1.5752% in the absolute value. 


Example E3.68 


Fifteen per cent of the parts produced in a manufacturing process fail a standardized quality test. 
(a) using the Poisson formula, find the probability that in a random sample of 100 parts which went 
through this test exactly 20 will fail. 


(b) using the Poisson probabilities table, find the probability that the number of parts which fail this 
test in a randomly selected 100 parts is 


(i) at most 9 
(ii) 10to16 
(iii) at least 20. 


SOLUTION: 
Let x denotes the number of parts in a random sample of 100 which fail the test. Since, on average. 15% of 
the parts fail the test, A = 0.15 x 100 = 15. 


(b) (i) ee ee ee 
= 0.0 + 0.0 + 0.0 + 0.0002 + 0.0006 + 0.0019 + 0.0048 + 0.0104 = 0.0194 + 0.0324 = 0.0697 
(ii) P(10to 16 fail) =PUO0<x< 16) = P(10) + P(11) + P(12) + P13) + P(14) + PLS) + PO6) 
= (1.0486 + 0.0663 + 0.0829 + 0.0956 + 0.1024 + 0.01024 + 0.0960 = 0.5942 
(iii) P(at least 20) = P(x 2 20) = P(20) + P(21) + P(22) + °° 
= 0.0418 + 0.299 + 0.0204 + 0.0133 + 0.0083 + 0.0050 + 0.0029 + 0.0016 + 0.0009 
+ 0.0004 + 0.0002 + 0.0001 + 0.0001 = 0.1249 
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Example E3.69 


If the probability is 0.10 that a certain mortgage company will refuse a loan application, use the normal 
approximation to determine the probability that the mortgage company will refuse at most 40 of 450 mortgage 
loan applications. 


SOLUTION: 
n= 450, p = 0.10, 1 = np = 4500.1) = 45, o = /np(— p) = 450(0.1)(0.9) = V40.5 = 6.3640 
on HES 
6.3640 


Referring to the table for normal distribution in Appendix-E corresponding to z = -0.7071, we find that the 
normal approximation to the probability is 0.22065. 


Hence, the probability that the mortgage company will refuse the load application is 0.22065. 


Example E3.70 


The number of nonconforming manufactured parts per shift is Poisson distributed, with a mean of 16. Find 
the probability that there are between 14 and 20 nonconforming parts on a shift that is 14, 15, 16, 17, 18, 19 
or 20. 

(a) use the Poisson distribution 


(b) use the normal approximation. 


SOLUTION: 
(a) A=15 
P[14 <x $20] = F(20) — F(13) = 0.917 — 0.363 = 0.554 
203-13) -4| 2 


Ol= 
(b) Pil4<x<20] 0) : 7 


= 0(1.375) — o(-0.375) = 0.9154 — 0.3538 = 0.5616 


Example E3.71 


Suppose we want to use the normal approximation to the binomial distribution to determine B(1; 150, 0.05) 
(a) is it justified in using the approximation? 


(b) make the approximation. 


SOLUTION: 


(a) Here n= 100, p =0.05 and mean = 150(0.05) = 75 and n(1 — p) = 150 — 7.5 = 142.5 
Yes. It is justified in using the approximation since np is > 5 and n(1 — p) is > 100. 
(b) w=7.5, 6 =np(1 —p) = 150(0.05)(0.95) = 7.125 or o = 2.67. 


0.5-7.5 1.5-7.5 
Z) = —— = -2.62 and Z 
> 6g em OT 
The entries in table for normal distribution in Appendix-E corresponding to z= —2.62 and z, = —2.25, we 


find that the normal approximation to the probability is 
0.012224 — 0.004396 = 0.007828 


iH} 

1] 

| 
i) 
) 
Nn 
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Example E3.72 
Refer to Example E3.71. Make the Poisson approximation. 
SOLUTION: 
-ry x ~7.5 ] 
Here A=7.5 and P(7.5) = ae a oe = (0,00055)7.5 = 0.0041 
Xx: I 
Example E3.73 


re 


In a survey it was found that 1.5% of the adult population in a large city suffer from drug addition. Of 100 
randomly selected adult population, let X denotes the number who have drug addiction problem. 


(a) what are the parameters for the appropriate normal distribution? 
(b) what are the parameters for approximating Poisson distribution? 
(c) compute the individual probabilities for the binomial distribution in part (a) 


(d) compute the individual probabilities for the Poisson distribution in part (b). Find the probabilities 
until they are zero to 4 decimal places. 


(e) compare the probabilities found in parts (c) and (d). 

(f) apply both the binomial probabilities and Poisson probabilities found in parts (c) and (d) to 
determine the probability that the number who suffer from drug addiction is exactly 3: between 2 
and 5 (inclusive); less than 4% of those surveyed; more than 2. Compare the two results in each 
case. 

SOLUTION: 

(a) Heren= 100 and p = 1.5/100 = 0.015 

(b) A=np=10000.015)=1.5 

(c) The binomial probability function is 


100 
P(X =x) = (0.015¥'(0.985)100~* forx=0, 1, 2, ..., 100 
is 


Starting at x = 0 and evaluating the function until it is zero to four decimal places, we obtain the following 
Table E3.73(a). 


Table E3.73(a) 


Pa 


0 
1 
2 
3 
4 
> 
6 
‘i 
8 
9 
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(d) The Poisson probability function is 


ilesie 
FCS: 7 leer ors) tle A ove 
x 


Starting at x = 0 and evaluating this function until it is zero to four decimal places, we obtain the following 
Table E3.73(b). 


Table E3.73(b) 


OMmMANMBRWN CO] 


(e) All the probabilities for x = 0 to 9 agree to 2 decimal places and are usually at most a couple of 
thousandths apart. The agreement is really good. 


(f) See Table E3.73(c). 
Table E3.73(c) 


Event Binomial Poisson 
probabilit probabilit 


Each pair of probabilities in the above Table E3.73(c) agree to within 0.0016 or less. 


3.6 CHEBYSHEV’S THEOREM | : 


The standard deviation 6 of a random variable X measures the weighted spread of the values of X about 
the mean ft. For smaller 6, we would expect that X will be closer to up. A more precise statement of this 
expectation is given by Chebyshev (1821-1894) which is stated as follows: 

Let X be a random variable with mean p. and standard deviation o. Then, for any integer k, the probability 


that a value of X lies in the interval [u — ko, + ko] is at least 1 — = , 


1 
Hence, Plu-kos X <prt+koj]= Saar (3.86) 


Proof: By definition 
oO = Var(X) = Xx, — )°p, (3.87) 
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Now, deleting all terms for which |x, — u| < ko and denoting the summation of the remaining terms by 
2*(x, — W)’p.. then, we have 


O° 2 2*(x, — w)’p, 2 L* Po’, = Po’X*p, = Ro*p(|X - | > ko) (3.88) 
or = Ko"[1 — p(X - | < ko)] = Ro"[1 — P(u-ko <X< u+ ko)] (3.89) 
when o > 0, and dividing Eq. (3.89) by k’0”, we get 
1 
pee oS 8 S10 t ka) (3.90) 
or Pluk s X Sp+ko) <1-— (3.91) 


Equation (3.91) is the Chebyshev’s inequality for o > 0. 
If o = 0, then x, = p for all p, > 0, and we have 


PIU-KO)SXS p+ KO] = PK =I) =1>1-— (3.91a) 


Chebyshev’s theorem gives a lower bound for the area under a curve between two points that are an 
opposite sides of the mean and at the same distance from the mean. Chebyshev’s theorem is stated as 


| 
follows: For any number k greater than 1, at least [ = *) of the data values lie within & standard deviations 


of the mean. Chebyshev’s theorem is illustrated in Fig. 3.9. 


At least 1 — 1/k’ of 
the values lie in 
the shaded area 


Fig. 3.9: Chebyshev’s theorem 


For k= 2 


1 
ae ao = 0.75 or 75% 


That is, at least 75% of the values of a data set lie within two standard deviations of the mean as 


shown in Fig. 3.10. 
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At least 75% of 
the values lie in 
the shaded area 


u— 20 uu +20 


Fig. 3.10: Percentage of values within two standard deviations of the mean 
Fork =3 


| ] 
be tae SOO Soe 
ko 3 
When k = 3, this inequality shows that regardless of the assumed distribution of X, the probability is 
no more than 1/3? = 1/9 that the random variable takes on a value more than three standard deviations away 


from its mean (see Fig. 3.11). 


At least 89% of 
the values lie in 
the shaded area 


p—3o rm U+3o 


Fig. 3.11: Percentage of values within three standard deviations of the mean 


Chebyshev’s inequality is applicable to both sample and population data. Chebyshev’s inequality is 
also applicable to distribution of any shape. It should be noted here that Chebyshev’s inequality can be 


used only for k > 1 (since when k = 1, 1 — * = (0 and when k< 1, 1 —- = is negative), 


Example E3.74 


enn er reser rence nee SSS SSS 
The mean and standard deviation for the final examination scores in Economics course are 80 and 7.5 
respectively. Determine the percentage of students who scored between 65 and 95, using Chebyshev’s 


theorem. 
SOLUTION: 
From the given data 
Mean u = 80 and standard deviation o = 10. 
Each of the two points, 60 and 95, is 15 units away from the mean. 


Hence, pa PEP, 


o 75. 
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1 
a 0.75 or 75% 
k° 28 


That is, at least 75% of the students scored between 60 and 95 as shown jn Fig. E3.74. 


Scores of at least 75% of 
the students fall in the 
shaded area 


lL —20 wu U+2o 


Fig. E3.74: Percentage of students who scores between 60 and 95 


Example E3.75 

A random variable X has mean pt = 35 and standard deviation 6 = 2. Apply Chebyshev’s inequality to estimate 
(a) P(X<45) 
(b) PX230) 

SOLUTION: 

Chebyshev’s inequality states: 


Pit — kos Xx Sj+ko)21-~5 


(a) Substituting p = 35 and o = 2 in + ko, we have 
Soe 4 Or ko 


Hence, 1-—= la 08 
5 


Similarly, p — ko = 35 — 10 = 25 
Chebyshev’s inequality gives 
P(25 < X < 45) 2 0.96. 
The event corresponding to X < 45 contains as a subset the event corresponding to 25 < X < 45. 
Hence 
P(X < 45) = P(25 < X < 45) 2 0.96 
Therefore, the probability that X is less than or equal to 45 is at least 96% 
(b) Substituting = 35 and o = 2 in p + ko, we have 
35 — 2k = 30 fork or k= 2.5 


(aie 


ke? as 
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Since p. + ko = 35 + 5 = 40, Chebyshev’s inequality gives 
P(30< X < 40) 2 0.84 

The event corresponding to X > 30 contains as a subset the event corresponding to 30 < X < 40. Hence 
P(X = 45) = PG0 < X < 40) 2 0.84 

Thus, the probability that X is greater than or equal to 30 is at least 84%. 


Example E3.76 


A random variable X has mean wt = 85 and standard deviation o = 5. 
(a) what inferences can be made from Chebyshev’s inequality for k = 2 and k = 3? 
(b) estimate the probability that X lies between 65 and 105. 


(c) determine an internal [a, b] about the mean for which the probability that X lies in the interval is at 
least 99%. 


SOLUTION: 
(a) We set k=2 and obtain p — ko = 85 — 2(5) = 75 and p + ko = 85 + 2(5) = 95. 
We can therefore conclude from Chebyshev’s inequality that the probability that a value of X lies 


between 75 and 95 is at least 1— = = (07/5. 


PUS SX%295): 20:75 


1 
If k = 3, we find that the probability that X lies between 70 and 100 is at least 1—- oa = 8/9 = 0.889 


(b) Here ko = 20 since 85 — 20 = 65 and 85 + 20 = 105 or k(5) = 20 or k= 4. 
Hence, by Chebyshev’s inequality 


P(65 <X <105)> i= = 0.94 
Thus, the probability X lies between 65 and 105 is at least 94%. 
(c) Setting as = 0.99, we get 1 — 0.99 = 1/k* or k= 10. 
Hence the interval is [85 — 10(5), 85 + 10(5)] = [35, 135]. 


3.7 EMPIRICAL RULE : | | 


If the distribution of the data is approximately bell-shaped, then empirical rule applies. The empirical rule is 
stated as follows: 


For a bell-shaped distribution, approximately 
1. 68% of the observations lie within one standard deviation of the mean. 
2. 95% of the observations lie within two standard deviations of the mean. 
3. 99.75% of the observations lie within three standard deviations of the mean. 
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The empirical rule is illustrated in Fig. 3.12. The bell-shaped distribution, also known as the normal 


distribution has been described earlier. The empirical rule is applicable to population data as well as sample 
data. 


u-30 pw-20 u-Go W+O w+2o0 p+3o 


0 
Fig. 3.12: Empirical rule 


Example E3.77 


The weight distribution of a sample of 10,000 male students in a particular university is bell-shaped with a 
mean of 40 kg and a standard deviation of 5 kg. Find the percentage of students who weigh approximately 
30 to 50 kg. 
SOLUTION: 
From the given data, we have for the sample 

Mean x = 40 kg and standard deviation s = 5 kg. 
Each of the two points, 30 and 50, is 10 units away from the sample mean of 40 kg. Dividing 10 by the 


1091 
standard deviation (5 kg), we have — = 7 = 2. The distance between 30 and 40 and between 40 and 50 kg 
Oo 


is each equal to 2s as shown in Fig. E3.77, the area from 30 to 50 kg is the area from x —2s to X¥ + 2s. 
Applying the empirical rule, approximately 95% of the male students in the sample weigh between 30 and 
50 kg. 


30 — 40 = -10 50 —40 = 10 


=—2s | =2s 


Ages 


Fig. E3.77: Percentage of male students who weigh between 30 and 50 kg 


3.8 . THE CENTRAL LIMIT THEOREM 


Many commonly used statistical methods depend on the central limit theorem. The central limit theorem 1s 
very important result in statistics. The central limit theorem states that if a large enough sample is drawn 
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from a population, then the distribution of the sample mean is approximately normal, no matter what 
population the sample was drawn from. If the variable is normally distributed, so is the variable mean, x . 
This key fact also holds approximately if x is not normally distributed, provided only that the sample size is 
relatively large. 

Let X,, X,, ....,X, be a simple random sample from a population with mean pt and variance o. 


eee x 


lect X= ” be the sample mean. 


Hl 


Let S =X, +X,+-+:: +X, be the sum of the sample observations. 
Then, if n is sufficiently large, 


5. 


X oN | | approximately 
n 


and S_~ N(np,, no’) approximately 


5 


ce 


aie Oo : 
The statement of the central limit theorem specifies that wu. and o. = — ., which hold for any sample 
n 


mean. The sum of the sample items is equal to the mean multiplied by the sample size, that is, S,, = nx . It 
follows that us = nt and Of =O iane.. 


Hence, for a relatively large sample size, the variable is approximately normally distributed, regardless 
of the distribution of the variable under consideration. The approximation becomes better with increasing 
sample size. 

The further the variable under consideration is from being normally distributed, the larger the sample 
size must be for a normal distribution to provide an adequate approximation to the distribution of x. 
Generally, however, a sample size of 30 or more (n 2 30) is large enough. 

The proof of the central limit theorem is difficult and very complex to be included here. 

Summarising, the central limit theorem says that if X , X,, -.., X, is a random sample of size n taken from 


a population (either finite or infinite) with mean yp and finite variance 0’, and if X is the sample mean, the 
limiting form of the distribution of 


X—p 
o/Jn 


as n — ©, is the standard normal distribution. The normal approximation for X depends on the sample size 
n. In many cases of practical interest, if n 2 30, the normal approximation will be satisfactory regardless of 


the shape of the population. If n < 30, the central limit theorem will work if the distribution of the population 
is not severely non normal. 


Example E3.77 


A mechanical component has tensile strength that is normally distributed with mean 75.5 MPa and standard 
deviation 3.5 MPa. Find the probability for a random sample of n = 6, these component specimens will have 
sample mean tensile strength that exceeds 75.75 MPa. 


Jb = 
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SOLUTION: 


Given Uz =75.5 MPa 
Therefore, oO; = == =1.429 


— x_ 
rE 21575) = P| :) = P(Z>0.175) = 1 -P(Z<0.175) 
o/Vn 


= 1 — 0.56945 = 0.43055 


Example E3.78 


A normal population has mean 80 and variance 25. How longer must the random sample be if we want the 
standard error of the sample average to be 1.5? 


SOLUTION: 
Given o* = 25 oro =5 and Ox =1.5 
oO 5 
Oy = =H =1S 
. Vn Hl 
2 2 
or eye -() Sid titer12 


PROBLEMS | 


Section 3.1 Random Variables 
P3.1 The frequency distribution of the number of orders received per day by a distribution company 
during the past 3 months (90 days) 


Number of orders received per da 
Number of days 


(a) construct a probability distribution table for the number of orders received per day 


(b) are the probabilities in (a) exact or approximate? 
(c) if x denotes the number of orders received on any given day. calculate the following 
probabilities 
(i) Px=3) 
(1H) RG 3) 
(iii) P2<Sx<4) 
(iv) P(x<4). 
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P3.2 


P3.3 


P3.4 


P3:3 


The following table gives the probability distribution of a discrete random variable x. 


0.14 | 0.18 | 0.15 | O20) 0.OORnGs 


Find the following probabilities: 


(a) P(x=4) 
(b) P(x>3) 
(c) PXx<3) 


(dj) PSxs5) 


The following table gives the probability distribution of automobiles sold on a given day in an 
automobile dealership. 


Automobile sold] O [1 [2 [3 __| 
Probabilit 0.05 | 0.10 | 0.20 [ 0.30 


Calculate the mean and standard deviation for this probability distribution. 


Table P3.4 gives the probability distribution where X represents the number of times the 
manufacturing process is recalibrated during a week whenever the quality of the component 
produced falls below certain specifications. Assume that X has the probability mass function as 
given in Table P3.4. 


(a) determine the mean of X 


(b) determine the variance and standard deviation for the random variable X. 


Table P3.5 shows the probability distribution of the number of breakdowns per week for a machine. 


| Breakdowns perweek]0_ | 1 [2 [3 | 
0.30 


(a) represent the probability graphically 

(b) find the probability that the number of breakdowns for this machine during a given week is 
(i) exactly 2 
(ii) Oto2 
(iii) more than 1 


(iv) at most 1 


P3.6 


P37 


P3.8 


P3.9 


P3.10 
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For each of the following, indicate whether it is or is not a random variable. 
(a) the number of automobiles sold at a dealership during a given month 
(b) the height of a person 

(c) the number of heart beats per minute 

(d) the price of an automobile 

(e) the time needed to complete a written examination 

(f) the number of automobiles a household owns 

(g) the number of tails obtained in four tosses of a balanced coin. 


The number of telephone calls received in a particular sales office between 10:00 am and 11:00 am 
has the probability function given in Table P3.7. 


Table P3.7 
Number of telephone calls received, x 
(a) is this a probability function? 
(b) find the probability that there will be three or more calls 


(c) find the probability that there will be an even number of calls. 


Table P3.8 gives the probability distribution of x, where x denotes the number of defective 
mechanical components produced in a manufacturing company. 


Table P3.8 


Determine the mean and standard deviation of x. 
Table P3.9 gives the number of telephone calls received per hour in an office and the distribution. 


Table P3.9 


Number of telephone calls received, x 


Determine the mean and standard deviation of the number of telephone calls received per hour in 
that office. 


Table P3.10 gives the results of the experiment of rolling a die with the discrete random variable 
number of dots. 
Table P3.10 


a ao | 


Determine the mean and variance of that random variable x. 
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Pei 


B32 


P33 


P3.14 


P3.15 


P3.16 


P37 
P3.18 


The time to failure of an electronic component is a continuous random variable known to have the 
density function 0.5e’ where f is in years. What is the probability that this component will fail 
within the first year of operation? 


Find the mean u for the probability density function, p(x), of the life of a projector bulb, random 
variable x, is given as: 


0 One) 
aud P=) 1900 6! + for x20 


The density function for a continuous random variable x is given as: 
f(&) =0.25(@ — 2) for2<x<5 
Sketch the density and distribution functions. 
The daily consumption of an electric power of a certain machine (in units of power) is a random 
variable whose probability density is given by: 
ox" = for x > 0 


fO)= } : for x <0 


Determine the probabilities that on a given day 
(a) the consumption of this machine is no more than 6 units 
(b) the power supply is inadequate in the daily capacity if the supply is 9 units. 
The total lifetime (in years) of a certain machine is a random variable whose distribution function 
is given by: 

' 0 for x <5 

f(x) = : 
be 2) 2s forx>5 

Find the probabilities that such a machine system will have life 
(a) beyond 10 years 


(b) less than 8 years 
(c) anywhere from 12 to 15 years. 


The probability density function of X is given by 


125—-x‘); O<x<1 
0) 2 otherwise 


f(x)= | 


where the random variable X denotes the clearance (in mm). The clearance is the difference between 
the radius of the hole drilled in a flat sheet-metal plate and a shaft inserted through the hole. 


Components with clearances larger than 0.8 mm are to be scrapped. What proportion of components 
are scrapped? 


Refer to Problem P3.16. Determine the mean clearance and the variance of the clearance. 
Determine the mean and variance for the function f (x) 


\ 
f(x)=}b-a 
0 


SS eSip 
; otherwise 


where X is the uniform random variable on the interval [a, b]. 
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P3.19 


P3.20 


me certain radioactive mass emits alpha particles from time to time. The time between emissions 
(in seconds) is random with the probability density function given by 


Ole x= 0 


2 SQ 


f(xy= 


Determine the median time between emissions. 


Determine the mean and variance for the following continuous random variable X with probability 
density function f(x) given by 
l —x/ 4 
pach > x20andA>0 
F)=4X 


0 ; otherwise 


Section 3.2 Permutations and Combinations 


P3.21 (a) Determine the number of permutations of the 26 letters of the alphabet. 
(b) Determine the number of ways of 6 distinct products that can be lined up on a display shelf. 

P3.22 A committee is to be formed consisting of 2 men and 2 women. There are 6 men and 7 women 
qualified and available to fill this position. How many different committees can be formed out of 
the finalists? 

P3.23 Develop a table of the number of possibilities when 2, 3, 4 and 5 letters of the alphabet are used. 
Generate the possibilities. English alphabet letters are used with and without repetition. 

P3.24 How many 5 digit numbers can be formed with the 10 digits 0, 1, 2, 3, ..., 9 if 
(a) repetitions are allowed 
(b) repetitions are not allowed 
(c) the last digit must be zero and repetitions are not allowed. 

P3.25 How many ways can an executive committee of 5 can be chosen from a pool of 15 members? 

P3.26 A company wants to purchase 4 electronic systems. After all the system models were reviewed, 8 
foreign made and 10 U.S. made systems were considered to satisfy all the security requirements 
for the company. 
(a) if the systems are chosen at random, find the probability that 2 of the systems selected are 

foreign made. 
(b) what is the probability that all the four systems selected are U.S. made? 
(c) what is the probability that all of the 4 systems selected are foreign made? 
(d) what is the probability that at least 2 of the systems are U.S. made? 
Permutations 

P3.27_ A pianist knows five pieces but will have enough stage time to play only four of them. Pieces 
played in a different order constitute a different program. How many different programs can be 
arranged? 

P3.28 There are six persons on a sinking boat. There are four life jackets on board. How many combinations 


of survivors are there? 
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P3.29 


P3.30 


P3.31 


P3.32 


P3.33 


P3.34 


P3.35 


P3.36 


P337 
P3.38 
P3339 
P3.40 


A violinist knows four pieces but will have enough time to play only three of them. Pieces played 
in a different order constitute a different program. Determine the number of different programs that 


can be arranged. 

A bin contains 5 white balls, 2 red balls, and 3 green balls. Find the probability of getting either a 
white ball or a red ball in one drawn from the bin. 

Five defective parts were unintentionally mixed with 45 non-defective (good) ones. After a thorough 
mixing, 5 parts are picked simultaneously from the collection of 50. 

(a) determine the probability that all 5 parts selected are non-defective ones 

(b) what is the probability that no more than 1 part selected is defective? 

(c) what is the probability that all 5 parts selected are defective? 


From 12 items, in how many ways can a selection of 6 be made? 

(a) when a specified item is always included 

(b) when a specified item is always excluded. 

Out of 6 manufacturing engineers and 9 design engineers, a committee consisting of 3 manufacturing 
engineers and 5 design engineers is to be formed. In how many ways can this be done if: 

(a) any manufacturing engineer and any design engineer can be included 

(b) one particular design engineer must be on the committee 

(c) two particular manufacturing engineers cannot be on the committee. 

Determine the number of ways of splitting 10 persons into 2 groups containing 4 and 6 persons 
respectively. 

A committee of 2 men and 3 women needs to be formed out of 7 men and 7 women. Determine the 
number of ways this can be done if 

(a) any man and any woman can be included 

(b) one particular woman must be on the committee 

(c) two particular men cannot be on the committee. 

Three cards are drawn in succession from a deck of 52 cards. Determine the number of ways this 
can be done 

(a) with replacement 

(b) without replacement. 


Determine the number of committees of 3 that can be formed from 8 persons. 
Determine the number of combinations of 4 taken 3 at a time. 
Find the number of ways of selecting two objects out of four objects. 


If 3 persons are to be selected randomly from 5 persons for a committee, determine the different 
possible combinations. 


Section 3.3 Discrete Distributions 


P3.41 


A lot contains 140 electronic components and 20 are selected without replacement for quality testing. 
(a) if 20 components are defective, what is the probability that at least one defective component 
is in the sample? 


(b) if 5 components are defective, what is the probability that at least one defective component 
appears in the sample? 
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P3.42 A lot of mechanical parts contains 24 in a package. It was determined to test 4 parts in each package 
and if all 4 parts pass the tests, then they are all accepted. If in a package of parts in which 3 are 
non-conforming, what is the probability of rejection of these parts? 


P3.43 Adie is rolled three times. Find the probability of getting one 4 in the three rolls. Also find the 
probability of getting two 4’s, three 4’s, and no 4’s in three rolls. 


P3.44 In a production process the defective rate is 15 per cent. Assuming a random sample of 10 items is 
drawn from this process, find the probability that two of them are defective. 


P3.45 Batches of 50 shock absorbers from a production process are tested for conformance to quality 
requirements. The mean number of non-conforming absorbers in a batch is 5. Assume that the 
number of non-conforming shock absorbers in a batch, denoted as x, is a binomial random variable. 
(a) find n and p 
(b) find p(s 2) 

(c) find p(x = 49). 


P3.46 A production process manufactures certain mechanical components for a machine system. On 
average, 1.5% of the components will not perform up to specifications. When a shipment of 100 
components is received at the plant, they are tested, and if more than 2 are defective, the shipment 
is returned to the manufacturer. What is the probability of returning a shipment? 


P3.47 A quality test engineer claims that | in 10 of certain manufactured parts is due to material defects. 
Using binomial distribution and rounding to four decimals, find the probability that at least of 3 
out of 5 of the tested parts are due to material defects. 


P3.48 At the ABC House Delivery Service, providing high-quality service to its customers is the top 
priority of the company. The company guarantees a refund of all charges if a package it is delivering 
does not arrive at it’s destination by the specified time. It is known from past data that despite all 
efforts, 3% of the packages mailed through this company do not arrive at their destinations within 
the specified time. A corporation mailed 10 packages through ABC House Delivery Service on 
Monday. 

(a) find the probability that exactly one of these 10 packages will not arrive at it’s destination 
within the specified time. 

(b) find the probability that at most one of these 10 packages will not arrive at it’s destination 
within the specified time. 


P3.49 Five per cent of a large batch of high-strength steel components purchased for a mechanical system 


are defective. 

(a) if seven components are randomly selected, find the probability that exactly three will be 
defective 

(b) find the probability that two or more components will be defective. 


P3.50 The number of customers arriving a bank in the next period is a Poisson distribution having a 
mean of eight. Find the probability that exactly six customers will arrive in the next period. 


P3.51 If the probability of a concrete beam failing in compression is 0.05, use the Poisson approximation 
to obtain the probability that from a sample of 50 beams 
(a) at least three will fail in compression 
(b) no beam will fail in compression. 
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P3:53 


P3.54 


R355 


P3.56 


| eo! 


P3:58 


P3893 


P3.60 


The number of defects on an electronic component which is used in a computerized system has 
been found to follow the Poisson distribution with A = 3. Find the probability that a randomly 
selected electronic component will have two or less defects. 

The number of telephone calls made to a certain company’s operator is a Poisson random variable 
with a mean of 5 calls per hour. 

(a) what is the probability that 5 calls are received in one hour? 

(b) what is the probability that 10 calls are received in 1.5 hour? 

(c) what is the probability that less than 2 calls are received in 1 — 1/2 hours? 

A photocopying machine in an office breaks down an average of three times per month. Using the 
Poisson probability distribution formula, find the probability that during the next month this machine 
will have 

(a) exactly two breakdowns 

(b) at most one breakdown. 

The proportion of mechanical manufactured parts that are non-conforming is 0.04. Obtain the 


Poisson approximation to the binomial distribution for the probability of three or fewer non- 
conforming parts in a sample of 100. 


An examination consists of five questions, and to pass the examination a student has to answer at 
least four questions correctly. Each question has three possible answers, of which only one is 
correct. If a student guesses on each question, what is the probability that the student will pass 
the test? 


The probability that a person undergoes a heart operation will recover is 0.6. Determine the 
probability that of the six patients who undergo similar heart operation: 

(a) none will recover 

(b) all will recover 

(c) half will recover 

(d) at least half will recover. 

In a certain manufacturing process, it was found from quality control inspection that 20% of the 
machine components produced by the process are defective. Determine the probability that out of 


4 machine components selected at random (a) 1, (b) 0, (c) at most 2 machine components will be 
defective. 


Given that the probability of an individual patient suffers a bad reaction from injection of a particular 
serum is 0.001. Determine the probability that out of 2000 individual patients. 
(a) exactly 3 individuals will suffer a bad reaction 


(b) more than 2 individuals will suffer a bad reaction. Use Poisson distribution. 


Use binomial distribution and repeat Problem 3.59. 


Section 3.4 Continuous Probability Distributions 


P3.61 


P3.62 


The inside diameter of a finished shaft of uniform diameter is normally distributed with a mean of 


4.50 cm and a standard deviation of 0.01 cm. What is the probability of obtaining a diameter 
exceeding 4.52 cm? 


The resistance of a foil strain gauge is normally distributed with a mean of 100 ohms and a standard 


deviation of 0.8 ohm. The specification limits are 100 +1.0 ohms. What percentage of gauges will 
be defective? 


P3.63 


P3.64 


P3.65 


P3.66 


P3.67 


P3.68 


P3.69 


P3.70 
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The measurement of the diameter of a special steel pipe is normally distributed with a mean of 
5.01 cm and standard deviation of 0.03 cm. The specification limits are 5.00 +0.05 cm. What 
percentage of pipes is not acceptable? 


The yield strength of metal component manufactured is N(192.36). A purchaser of the metal 
components requires strength of at least 180 psi. The probability that a metal component will meet 
or exceed the specifications is given by P(x > 180). Determine the probability that a metal component 
meets or exceeds the specifications. 


Find the probability that the yield strength of the metal components in Problem P3.6+4 is between 
180 and 198 psi. 


The diameter of a machine shaft produced in a manufacturing company is normally distributed 
with a mean diameter of 0.001 inches and a standard deviation of 0.0002 inches. 


(a) what is the probability that the diameter of the shaft exceeds 0.0013 inches? 

(b) what is the probability that a diameter of the shaft is between 0.0007 and 0.0013 inches? 

(c) what standard deviation of diameters is needed so that the probability in part ()) is 0.995? 
The mass, m, of a particular machine part is normally distributed with a mean of 66 kg and a standard 
deviation of 5 kg. 

(a) what percentage of the parts will have a mass less than 72 kg? 

(b) what percentage of parts will have a mass in excess of 72 kg? 

(c) what per cent of the parts will have a mass between 61 and 72 kg? 

It was determined experimentally that the load X required to break a plate is normally distributed 
with mean 2.5 and standard deviation 0.24. Determine the probability that 

(a) the plate breaks at a load of 2.61 or less 

(b) the plate breaks at a load of more than 2.39 

(c) the breaking load is at least 2.86 

(d) the breaking load is in the interval (2.61, 2.86). 

Use the Table in Appendix-E to determine the following probabilities for the standard normal random 
variable z. 

(a) P(e <30) 

(b) P(z>-2.15) 

(c) P(-3<z<3) 

(d) PO<z<1) 

(2) Fiz 3) 

Assume z has a standard normal distribution. Use Appendix-E table to determine the value for < 
that solves each of the following: 

(a) P(Z<z=05 

(b) P(Z>2z=09 

(c) Pl-2z<Z<z=0.99 

(d) P(-z<Z<z)=0.68 

(e) P(-1.24<Z<z)=08 
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P3.72 


P3738 


P3.74 


The diameter of a component in a machine system is normally distributed with mean 0.2508 cm and 
standard deviation 0.0005 cm. The specifications on the component are 0.25 +0.0015 cm. Determine 
the proportion of components confirms to specifications. 


The mass, UL, of a particular electronic component is normally distributed with a mean of 66 g and a 
standard deviation of 5 g. Determine 

(a) the per cent of components that will have a mass less than 72 g 

(b) the per cent of components that will have a mass in excess of 72 kg 

(c) the per cent of components that will have a mass between 61 and 72 g. 


Customers buying copper rods supplied by a certain manufacturer require that the rods be between 
9.9 cm and 10.5 cm, inclusive. The manufacturing process is such that the actual rod lengths are 
well approximated by a normal distribution with mean 10.1 cm and standard deviation 0.20 cm. 
Determine the percentage of the manufacturer’s production is acceptable to the customer. 


Refer to Problem P3.73 and determine what rod length is exceeded by 95% of the manufacturer’s 
product. 


Section 3.5 Approximating Probability Distributions 


P3.73 


P3276 


E377 


P3.78 


P3728 


P3.80 


P3.81 


In a production lot of 100 components, five of them are found to be non-conforming. Approximate 
the probability that a random sample of 10 components contains no more than 1 non-confoiming 
component (hypergeometric distribution) by the binomial distribution. 


The proportion of components manufactured that are non-conforming is 0.04. The probability of 
three of fewer non-conforming components in a random sample of 100 is given by the binomial 
distribution. Make an approximation using Poisson distribution. 


Out of a batch of 2800 electronic components received 25% of them were defective or non- 
conforming. In a sample of 50 randomly selected, the probability that between 12 and 14 are given 
by the binomial distribution. Use normal distribution to this binomial distribution. 


A lot of electronic components are known to contain p = 0.12 non-conforming. In a sample of 100, 
the probability that the fraction non-conforming is 0.10 < p < 0.20 is given by the normal distribution. 
Make an approximation. 


The number of telephone calls received in a specified time by an operator in a manufacturing 
company is Poisson distribution with mean A = 14. Determine the probability that between 10 and 
18 calls are received by the operator in that specified time using normal approximation. 


Determine the probability that in a sample of 10 machine components chosen at random, exactly 
two will be defective by using 

(a) the binomial distribution 

(b) the Poisson approximate to the binomial distribution. 


Given that 10% of the machine components produced in that manufacturing process are defective. 


In some manufacturing process that a production of 200 components contains 5 components that 
do not meet the quality specifications. Determine the probability that a random sample of 10 


components will contain no non-conforming components. Use binomial approximation to the 
hypergeometric. 
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P3.83 


P3.84 


P3.85 


P3.86 


P3.87 


P3.88 


P3.89 


P3.90 


P3i91 


P3.92 


Assume that an examination has 10 questions of the type true or false. If the student taking such 
an examination guesses at all 10 questions, determine the probability that the student answers 
either seven or eight questions correctly. 


(a) Ifa fair coin is tossed 100 times, use the normal curve to approximate the probability that the 
number of heads is between 35 and 45 inclusive. 


(b) Ifa fair coin is tossed 100 times, use the normal curve to approximate the probability that the 
number of heads is between 35 and 45 exclusive. 


In a certain university, 25% of the students are over 21 years of age. In a sample of 400 students, 
what is the probability that more than 110 of them are over 21 years of age? 


A soft drink company conducted a taste survey marketing a new soft drink. The results of this 
survey showed that 70% of the people who tried the drink liked it. Encouraged by this result, the 
company decided to market the new soft drink. Assume that 70% of all people like this drink. On a 
certain day, 100 customers bought this soft drink. 

(a) find the probability that exactly 65 out of 100 customers will like this drink 

(b) find the probability that exactly 60 or less of the 100 customers will like this drink 

(c) find the probability that exactly 75 to 80 of the 100 customers will like this drink 


The length of a bolt manufactured in a certain manufacturing process has a mean of 50 mm and 
standard deviation of 0.45 mm. Determine the largest possible value for the probability that the 
length of the bolt is outside the interval 49.1 — 50.9 mm. 


In a mechanical engineering design class, the mean for the final examination scores is 75 and the 
standard deviation is 5. Using Chebyshev’s theorem, find the percentage of students who scored 
between 60 and 90. 


Given that X is a random variable with the M(u, o*) distribution. Denoting the standardised X by Z, 
make the comparisons between the actual and the Chebyschev’s bounds for Z = 1, 2, and 3(k = I, 
2, and 3). 

In mechanics class, the mean for the midterm scores is 65 and the standard deviation is 8. Using 
Chebyshev’s theorem, find the percentage of students who scored between 49 and 81. 


The 2007 gross sales of all firms in a large city have a mean of 3.3 million and a standard deviation 

of 0.6 million. Using Chebyshev’s theorem, find at least what percentage of firms in this city had 

2007 gross sales of 

(a) $2.1 to $4.5 million 

(b) $1.8 to $4.8 million 

(c) $1.5 to $5.1 million 

(a) Let X be arandom variable with mean Lt = 50 and standard deviation o = 6. Use Chebyshev’s 
inequality to find a value for b for which P(50-bS XS 50 + b) 20.95. 

(b) Let X be a random variable with mean pt = 70 and unknown standard deviation 6. Use 
Chebyshev’s inequality to find a value for o for which P(65 < X < 75) 2 0.90. 


The age distribution of a sample of 6000 persons is bell-shaped with a mean of 50 years and a 
standard deviation of 10 years. Determine the approximate percentage of people who are 30 to 70 


years old. 
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P3.94 


P3.95 


P3.96 


P3:97 


P3.98 


A manufacturing company manufactures steel rods of 100 cm length and a standard deviation of 
10 cm. The distribution of the rod is normal. Find the probability that a random sample of n = 25 
rods will have an average length less than 95 cm. 


The following table gives the probability mass function of X, where X denotes the number of defects 


in a 1 m length of aluminium wire. One hundred wires are sampled from this population. Find the 
probability that the average number of defects per wire in this sample is less than 0.5. 


Suppose that a random sample of size n = 12 is taken from the uniform distribution on the interval 


[0, 1]. Determine the value of P | a B 


i so), 


ya) 


Suppose that a random variable X has a continuous uniform distribution 


l/2e; Aaa 


QO ; otherwise 


f(x)= | 


Find the distribution of the sample mean of a random sample of size n = 50. Use central limit theorem. 


At a large university, the mean age of the students is 21.3 years, and the standard deviation is 
4 years. A random sample of 64 students is drawn. What is the probability that the average age of 
these students is greater than 22 years? 

The GPAs of all 5540 students enrolled at a university have an approximate normal distribution 
with a mean of 3.02 and a standard deviation of 0.29. Let x be the mean GPA of a random sample 


of 48 students selected from this university. Determine the mean and standard deviation of ¥ and 
comment on the shape of its sampling distribution. 


REVIEW QUESTIONS 


iF 


Explain the meaning of probability distribution of a discrete random variable. What are the various 
ways to present the probability distribution of a discrete random variable? 


Briefly explain the two-characteristics (conditions) of the probability distribution of a discrete 
random variable. 


Define the terms mean and standard deviation of a discrete random variable. 
Briefly explain the following: 

(a) binomial experiment 

(b) trial 

(c) binomial random variable 

Describe the parameters of the binomial probability distribution. 

Define the following: 


(a) n-factorial (b) combinations 
(c) permutations 
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7. What are the conditions that must be satisfied to apply Poisson probability distribution? 
What is the parameter of the Poisson distribution? 
9. Define the following terms: 
(a) Bernoulli trial 
(b) Binomial parameters 
(c) Binomial probability distribution 
(d) continuous random variable 
(e) discrete random variable 
(f) random variable 
(g) z-value or z-score 


10. Describe the difference between the probability distribution of a discrete random variable and that 
of a continuous random variable. 


11. Explain the main characteristics of a normal distribution. 
12. Describe the standard normal distribution curve. 
13. Describe the parameters of the normal distribution. 


14. Describe the conditions for the normal distribution to be used as an approximation to the binomial 


distribution. 
- STATE TRUEOR FALSE | 
1. A random variable is a quantitative variable whose value depends on chance. (True/False) 
2. A discrete random variable is a random variable whose possible values cannot be listed. 
(True/False) 
3. Probability distribution is a listing of the possible values and corresponding probabilities of a 
discrete random variable, or a formula for the probabilities. (True/False) 


4. Probability histogram 1s a graph of the probability distribution that displays the possible values of 
a discrete random variable on the horizontal axis and the probabilities of those values on the vertical 


axis. (True/False) 

5. For any discrete random variable, XP(X = 1) = 1. (True/False) 
The term expected value and expectation are commonly used in place of the term mean. 

(True/False) 

Wek K(k — 1) *** 2° 1. (True/False) 

1) (True/False) 

9. In Bernoulli trials, the trials are not independent. (True/False) 


10. In Bernoulli trials, the probability of a success, called the success probability, remains the same 
from trial to trial. (True/False) 
11. In Bernoulli trials, the experiment (each trial) has more than two possibilities. (True/False) 


12. The binomial distribution is the probability distribution for the number of failures in a sequence of 


Bernoulli trials. (True/False) 


13. In Bernoulli trials, the number of outcomes that contain exactly x successes equals the binomial 


n 
coefficient {"| : (True/False) 
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1, 


16. 


1 
18. 
19: 


20. 


pay 


22, 


23. 


24. 


23 


26. 


38. 


The mean of a binomial random variable with parameters n and p is ft = np. (True/False) 


The standard deviation of binomial random variable with parameters n and p is o = ynp(1— p). 


(True/False) 
The mean of a Poisson random variable with parameter A is p = A. (True/False) 
The standard deviation of a Poisson random variable with parameter A is 6 = VA. (True/False) 
A random variable is a quantitative variable whose value depends on chance. (True/False) 
A discrete random variable is a random variable whose possible values cannot be listed. 
(True/False) 
The sum of the probabilities of the possible values of a discrete random variable equals 0. 
(True/False) 
The number of possible permutations of m objects among themselves is m!. (True/False) 
The number of possible samples of size n from a population of size N is NC’. (True/False) 
For any two events, the probability that one or the other of the events occurs equal the sum of the 
two individual probabilities. (True/False) 
For any event, the probability that it occurs equals 1 minus the probability that it does not occur. 
(True/False) 
Data obtained by observing values of one variable of a population are called univariate data. 
(True/False) 
Data obtained by observing values of two variables of a population are called bivariate data. 
(True/False) 
A frequency distribution for bivariate data is called contingency table, or two-way table. 
(True/False) 
The joint probability equals the product of the marginal probabilities. (True/False) 


For a normally distributed variable, the percentage of all possible observations within any specified 
range equals the corresponding area under its associated normal curve, expressed as a percentage. 


(True/False) 
A normally distributed variable having mean 0 and standard deviation 1 is said to have the standard 
normal distribution. (True/False) 


A normal distribution is completely determined by the mean and standard deviation. (True/False) 
The shape of a normal distribution is completely determined by its standard deviation.(True/False) 


The total area under the standard normal curve is —1. (True/False) 
The standard normal curve extends indefinitely in both directions, approaching, but never touches, 
the horizontal axis as it does so. (True/False) 
The standard normal curve is symmetric about 0. (True/False) 
Almost all the area under the standard normal curve lies between —1 and +1. (True/False) 


For a normally distributed variable, one can determine the percentage of all possible observations 
that lie within any specified range by first converting z-scores and then obtaining the corresponding 
area under the standard normal curve. (True/False) 
The rule of thumb for using the normal approximation to the binomial is that np and n(1 — p) are 5 
are greater. (True/False) 
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39. A variable is said to be normally distributed if its distribution has the shape of a normal curve. 
(True/False) 
40. If a variable of a population is normally distributed and is the only variable under consideration. 
common practice is to say that the population is a normally distributed population. (True/False) 
41. The parameters for a normal curve are the corresponding mean and standard deviation of the 
variable. (True/False) 
42. Two variables that have the same mean and standard deviation have the same distribution. 
(True/False) 
43. Two normally distributed variables that have the same mean and standard deviations have the 
same distribution. (True/False) 
44. Two normal distributions that have the same mean are centred at the same place, regardless of the 
relationship between their standard deviations. (True/False) 
45. Two normal distributions that have the same standard deviations have the same shape, regardless 
of the relationship between their means. (True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. False 3. True 4. True 5.True 6.True 7. True 8.True 9. False 10. True 
11. False 12. False 13. True 14. True 15. True 16. True 17. True 18. True 19. True 20. False 
21. True 22. True 23. False 24. True 25. True 26. True 27. True 28. True 29. True 30. True 
31. True 32. True 33. False 34. True 35. True 36. False 37. True 38. True 39. True 40. True 
41. True 42. False 43. True 44. True 45. True 


TO 


CHAPTER 


Sampling Distributions 


This chapter introduces the reader the basic sampling distributions. Inferential statistics is used here to 
draw conclusions about a population, based on a probabilistic model of random samples of the population. 
Since different random samples will most likely give different estimates, some knowledge of the variability 
of all possible estimates derived from random samples is needed to arrive at reasonable conclusions. 
Population is any finite set of objects being investigated. A sample of objects drawn from a population is a 
random sample. 


If the populations contains N elements and a sample of n of them is to be selected, then if each of the 
N! 
(N —n)!n! 


as random sampling. Due to the difficulty in obtaining random samples in practice tables of random numbers 
are used. 


possible samples has an equal probability of being chosen, the procedure employed is known 


Sampling theory is the study of relationships existing between a population and samples drawn from 
the population. Sampling theory is useful in finding whether observed differences between two samples 
are actually due to chance variation or whether they are really significant. A study of inferences made 
regarding a population by the use of samples drawn from it, along with indications of the accuracy of such 
inferences using probability theory is known as statistical inferences. A statistic is a function of the 
observations in a random sample, which is not dependent on unknown parameters. A parameter is in general 


an unknown constant. For example, the parameters of a normal distribution are 4 and 6, whereas X and s? 
are statistics. The behaviour of sample statistics is needed in order to draw conclusions about a sample. 
The probabilistic distribution of a random variable defined on a space of random samples is called a sampling 
distribution. The behaviour of the sample statistics is described by a sampling distribution. Several 


sampling distributions are discussed in this chapter and their application to inferential statistics in Chapter 
5 (Estimation) and Chapter 6 (Hypothesis Testing). 


Suppose that y,, y,, ..., y, represents a sample. Then the sample mean 


jee (4.1) 
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and the sample variance 


Nn 
YO; -7Y 
ge! (4.2) 
= Ih 


are both statistics. These quantities are measures of the central tendency and dispersion of the sample. 


respectively. Sometimes s = Vs~ , called the sample standard deviation, is used as a measure of dispersion. 


4.1 PROPERTIES OF SAMPLE MEAN AND VARIANCE 


The sample mean y is a point estimator of the population mean p, and the sample variance s° is a point 


estimator of the population variance o°. An estimator of an unknown parameter is a statistic that corresponds 
to that parameter. Note that a point estimator is a random variable. A numerical value of an estimator computed 
from sample data, is called an estimate. There are several properties required of good point estimators. Two 
of the most important are the following: 


1. The point estimator must be unbiased. The long-run average of expected value of the point 
estimator should be the parameter that is being estimated. An unbiasedness is a desirable property 
and this alone does not always make an estimator a good one. 


2. An unbiased estimator must have minimum variance. The minimum variance point estimator has a 
variance that is smaller than the variance of any other estimator of that parameter. Here, we show 


that y and s° are unbiased estimators of and 0°, respectively. Using the properties of expectation, 


= = 1 n 1 nt 1 tt 
E(y) = E| =— -12($>,]=2 3200-2 5u=n (4.3) 
n Ray sae Ai 
since the expected value of each observation y, is p. Thus, y is an unbiased estimator of LL. 
Considering the sample variances s?. We get 
Dh : 
xe y 1 


> 1 
2, _ pp i=l = nT ed eas ST EAS (4.4) 
BO 175 ieee —E| 20% ma — (SS) 


where SS = Doak y,; — y) is the correct sum of squares of the observation y,. Now 


i=l i=l 


ESS) = e| Sox 5 |- e| Sy 19 


3 (uw? +.07)—n(u? +07 /n) =(n—-1o” (4.5) 


i=l 
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1 


n—-| 


Hence, E(S’)= E(SS) = 0? (4.6) 


Therefore, s? is an unbiased estimator of o?. The quantity n — 1 in Equation (4.5) is called the number of 
degrees of freedom of the sum of squares SS. If y is a random variable with variance o* and 


SS = Da yi- yy has degrees of freedom, then 


(=) =o? (4.7) 


v 


The number of degrees of freedom of a sum of squares is equal to the number of independent elements 


5 


in that sum of squares. For example, SS = ae Oi- y) in Equation (4.5) consists of the sum of squares 
a 


of the n elements y, - py, y, - y., .--» ¥, — y- These elements are not all independent since 


SS = eC —¥) = 0; in fact, only n — 1 of them are independent, implying that SS has n — 1 degrees 
of freedom. 


The standard error of a statistic is the standard deviation of its sampling distribution. If the standard 
error involves unknown parameters whose values can be estimated, substitution of these estimates into 
the standard error results in an estimated standard error. Suppose, we are sampling from a normal distribution 


with mean p and variance 6”. Now the distribution of X is normal with mean pu and variance 6?/n, so the 


standard error of X is 


(oy 
ae (4.7a) 
The estimated standard error of X is 
ae 
a5 (4.7b) 


4.2 POPULATION AND SAMPLING DISTRIBUTIONS —_ 


Here, we introduce the basic concept of population distribution and sampling distribution. 


4.2.1 Population Distribution 


The population distribution is the probability distribution of the population data. It is the probability 
distribution derived from the information on all elements of a population. 


4.2.2 Sampling Distribution 

A sample provides data for only a portion of an entire population and therefore we cannot expect the sample 
to yield perfectly accurate information about the population. The value of a population parameter is always 
constant. Any population data set has only one value of the population mean, .. We would expect different 
samples of the same size drawn from the same population to give different values of the sample mean, xX. 
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X , 1s therefore a random variable and it possesses a probability distribution. For each 
sample we can find a statistic, such as the mean, standard deviation, etc., which will vary from sample to 
sample. As a consequence, we obtain a distribution of the statistic which is called its sampling distribution. 
Sample statistics such as the mean, median, mode and standard deviation all possess sampling distributions. 
In general, the probability distribution of a sample statistic is called its sampling distribution. The probability 
distribution of x is called its sampling distribution. It lists the values that X can assume and the probability 
of each value of x. 


4.3 - SAMPLING AND NONSAMPLING ERRORS 


Since a sample provides data for only a portion of an entire population, we cannot expect the sample to 
give perfectly accurate information about the population. Hence, we can expect a certain information about 
the population. Hence, we can expect a certain amount of error called sampling error - will result simply 
because we are sampling. Sampling error is the difference between the value of a sample statistic and the 
value of the corresponding population parameter. In the case of the mean, 


The sample mean, 


Sampling error = x -—p (4.8) 
assuming that the sample is random and no nonsampling error has been made. 


A sampling error occurs due to chance. The errors that occur in the collection, recording and tabulation 
of data are called nonsampling errors. Such errors occur due to human mistakes and not chance. The larger 
the sample size, the smaller the sampling error tends to be in estimating a population mean, LL. by a sample 


mean X. 


Example E4.1 


The mean ages of all students in a large university follow a distribution that is skewed to the right is 24 
years and a standard deviation of 4 years. Find the probability that the mean age for a random sample of 36 
students would be 


(a) between 23 and 25 years 
(b) less than 23 years. 


SOLUTION: 
Given the population mean = 24 years, and n = 36. 


The standard deviation of the sample mean is 


oO 4 


GO. =—= = ——= = 0.6667 years 
Ea | 4136 
304 
,: S = = —1.50 
(a) For ao AZ 0.6667 
524) 
,= * = = 1.50 
BOO = 29: 2 1 6667 


P(23.< ® <25)=P(-1.50 < z < 1.50) = (0.933193 — 0.066807) = 0.86639 
(from the table in Appendix-E) 
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(b) For HO 


P(X <25) = P(z< -1.50) = 0.066807 (from the table in Appendix-E). 


4.4 - MEAN AND STANDARD DEVIATION OF x 


For a variable x and a given sample size, the distribution of the variable X is called the sampling distribution 
of the sample mean. 


The mean and standard deviation of the sampling distribution of X are called the mean and standard 


deviation of X and are denoted by uw; and oO, respectively. 


The standard deviation of X is also called the standard error of x. There is a simple relationship 
between the mean of the variable ¥ and the mean of the variable under consideration. They are equal, or 
uw, =. Thus, for any particular sample size, the mean of all possible samples means equals the populations 
mean. This equality holds true regardless of the size of the sample. 

Hence, the mean of the sampling distribution of X is always equal to the mean of the population. 

ey (4.9) 

The sample mean x, is called an estimate of the population mean, uw. When the expected value (or 

mean) of a sample statistic is equal to the value of the corresponding population parameter, that simple 


Statistic is said to be an unbiased estimator. For the sample mean x , U, =u. Therefore, x is an unbiased 


estimator of |. 


Standard Deviation of the Sample Mean 
For samples of size n, the standard deviation of the variable x equals the standard deviation of the variable 
under consideration divided by the square root of the sample size. 

Oo 


ne (4.10) 


The above Eq. (4.10) assumes the sampling is done with replacement from a finite population or when 
it is done from an infinite population. When sampling is done without replacement from a finite population, 
the appropriate formula is 


Hence, O; = 


ae Nan oO. 
¥ N-1 Jn (4.11) 


where n denotes the sample size and N the population size. 


The sample size is considered to be small compared to the population size if the sample is equal to or 
less than 5% of the population size. 


That is, if n/N < 0.05. 
When the sample size is small relative to the population size, there is little difference between sampling 


with and without replacement. In such cases, the two formulae Eqs. (4.10) and (4.11) for 6, yield almost 
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the same values. However, in most practical applications, the sample size is small relative to the population 
size. 


The possible sample means cluster more closely around the population mean as the sample size 
increases, and therefore the larger the sample size, the smaller the sampling error tends to be in estimating a 
population mean by a sample mean. 


The spread of the sampling distribution of X¥ is smaller than the spread of the corresponding population 


distribution. In other words, o, <6. The standard deviation of the sampling distribution of ¥ decreases as 


the sample size increases. 


If the standard deviation of a sample statistic decreases as the sample size is increased, that statistic is 


S : F OnmE : , FE 
said to be a consistent estimator. From Fq.(4.10), 6, =—=., it is clear that as n increases, the value of Rat 


Vn 


' oO =o , 
also increases and as a result, the value of — decreases. Hence, the sample mean XY is a consistent 


vn 


estimator of the population mean, wU. 


Example E4.2 


A population random variable X has mean 120 and standard deviation 15. Find the mean and standard 


deviation of the sample mean X for random samples of size 5 drawn with replacement. 


SOLUTION: 
For the population, pt = 120, o = 15. 


The mean pz and the standard deviation Oz of X are given by 


and : Oe ls 


Example E4.3 


A population random variable X has mean 120 and standard deviation 15. Find the mean and standard 


deviation of X if the population size is 300 and the samples of size 5 are drawn without replacement. 


SOLUTION: 
Here N= 300 andn=5 
Mean Wy =H =120,0=15. 


and the standard deviation 


a = 13) (500 =) 


Oy = ya ~ 5 V 300-1 


= 6.6632 
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Example E4.4 


Let 5 = {1, 3, 7, 9}. Find the probability distribution of the sample mean x for random samples of size 2 
drawn with replacement. 


SOLUTION: 


Since S has 4 elements, there are 42 = 16 random samples of size 2 drawn with replacement. These pairs and 
their average values are given in Table P4.4. 


Table P4.4 


The probability distribution of X is given in Table P4.4(a) 
Table P4.4(a) 


Example E4.5 
Pet Sell, 37,9) 


(a) list all samples of size 3, without replacement 


(b) how many samples, without replacement, are there of size 4, size n? 
SOLUTION: 


(a) A sample of size 3, without replacement is a subset of S containing 3 elements. There are 


[)-4 


Subsets: (1; 3;7).41,3, 9}, 41 7, 9), (3,720 


4 
(b) Forn= 1, 2, 3, 4, there are (" samples of size n; for n > 4, there are no samples of size n. 


Example E4.6 


Caen nr re SS 


Let S = {1, 3, 7, 9}. Find the probability distribution of the sample mean X for random samples of size 2 
drawn without replacement. 


/! Sampling Distributions // 181 


SOLUTION: 
Si ‘ 
ince S has 4 elements, there are ce 6 random samples of size 2 drawn without replacement. These. their 


average value, and the probability distribution of ¥ are given in Tables P4.6 and P4.6(a). 
Table P4.6 


Table P4.6(a) 


Example E4.7 


How many teams of 5 students can be randomly selected from a class of 10 female students and 15 male 
students? (a) how many teams will have all male students? (b) how many teams will have all female 
students? (c) how many teams will have 3 female students and 2 male students? 


SOLUTION: 


The number of 5-student teams is the number of ways at 5 students can be selected from a class of 25 
students, or the number of samples of size 5 that can be selected, without replacement, from a population 
of size 25, which is 


25 
5 ] = 53, 130 (from the table in Appendix-B) 
(a) The number of teams that will have all male students is 
15 . ; 
Pyle 3003 (from the table in Appendix-B) 
(b) The number of teams that have all female students is 
10 ; : 
Pic 252 (from the table in Appendix-B) 
(c) The number of teams that have 3 female students and 2 male students is 


3] be = (120)(150) = 12,600 (see Appendix-B). 


Determine the most likely breakdown of male students and female students in a team of 5 randomly selected 
from 15 male and 10 female students. 
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SOLUTION: 
The ratio of 15 male students to 10 female students is 3 to 2. Hence, a team of 3 male and 2 female students 
would occur at random. From Problem P4.7, we have 

5 male students in a team = 3003 

5 female students in a team 202 

3 female students and 2 male students in a team = 12600 


In a similar manner, we obtain the following counts: (use the table in Appendix-B) 
Is 10 
1 male and 4 female students in a team = Tee || = (15)(210) = 3150 


pls 


10 
3 male and 2 female students in a team = | 3 il > = (455)(45) = 20,475 


15)\( 10 
i | = 1365) 10)'= 13:650 


4 male and 1 female students in a team = 4 


4.5 . SHAPE OF THE SAMPLING DISTRIBUTION OF <x 


In Section 4.4, we described the sampling distribution of the sample mean, that is, the distribution of the 
variable x . It was shown there that the mean and standard deviation of x can be expressed in terms of the 
sample size and the population mean and standard deviation: 


u;=p and o au 
x x Vn 


n 

In this section, we describe the shape of the sampling distribution of x as related to the following two 
cases: 

1. The population from which samples are drawn has a normal distribution. 


2. The population from which samples are drawn does not have a normal distribution. 
4.5.1 Sampling from a Normally Distributed Population 


If the population variable x of a population is normally distributed with mean p and standard deviation o, 


then the sampling distribution of the sample mean, X , will also be normally distributed with the mean ui and 


standard deviation o/Vn 3 


Thatis bb, =p 
eo} n 
and Oz = Ta W < 0.05 (4.12) 


From the sampling distribution of x for different sample sizes, n, it can be observed that the spread of 


the sampling distribution of X decreases as the sample size increases. 
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Example E4.9 


The lengths of all machine parts made by a company have a distribution that is skewed to the right with a 
mean of 68 mm and a standard deviation of 4 mm. Find the probability that the mean length of a random 
sample of 100 parts produced by this company would be 


(a) less than 67.8 mm 
(b) between 67.5 mm and 68.7 mm 
(c) within 0.6 mm of the population mean 


(d) lower than the population mean by 0.5 mm or more. 
SOLUTION: 
Given [W=68 mm,o=4 mm andn= 100. 


The standard deviation of the sample mean 


oO 4 


x = Sss= = (44 vin 
| An 4100 


ie} 


67.8 -—68 
(a) For X =67.8mm: ernie: = a. 


P(x <67.8) = P(z <-0.50) = 0.308538 (from the table in Appendix-E). 


67.5 —68 
(b) Hot — 67 .: ae ws 


68.7 — 68 
For xX =68.7: z= ————- = 1.75 
0.4 


P(67.5< X $67.8) =P(-1.25< z¥ 1.75) = (0.105650 + 0.959941) = 0.85429 
(from the table in Appendix-£). 
(c) P(X within 0.6 mm of w) = P(67.4< x < 68.6) 


67.4—68 


Cao : Z=———_ = - 1.50 
ror x =6/47-2 An 
68.6 — 68 

For X =68.6: z <a Toe = Joi!) 

P(674 5 X $68.6) = P(-1.50 <z < 1.50) = 0.933193 - 0.066807 = 0.86639 
(from the table in Appendix-£). 

(d) P(X lower than p by 0.5 mm or more) = p(x $67.5) 
P(X $67.5) = P(z $-1.25) = 0.10565 (from the table in Appendix-£). 
67.5 — 68 
For X =67.5: z=———_ = 1.25 


0.4 
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4.5.2 Sampling from a Population that is not Normally Distributed 
If the sampling is done from a population that is not normally distributed, then the shape of the sampling 


distribution of ¥ is inferred from central limit theorem (see Chapter 3, Section 3.8). 


4.6 - APPLICATIONS OF THE SAMPLING DISTRIBUTION OF <x 


It was shown in Chapter 3 (Section 3.8) on central limit theorem that for large samples, the sampling distribution 
of ¥ is approximately normal, regardless of the distribution of the variable under considerations. The 
approximation becomes better with increasing sample size. In general, the farther the variable under consideration 
is from being normally distributed, the larger the sample size must be for a normal distribution to provide an 


adequate approximation to the distribution of ¥. A sample size of 30 or more (n 2 30) is large enough. 
The sampling distribution of the sample mean can be summarised as follows: 
If the variable x of a population has mean uw and standard deviation o, then for samples of size n: 


1. The mean of X equals the population mean, or Lz = LL. 


2. The standard deviation of ¥ equals the population standard deviation divided by the square root 


: 0 
of the sample size, or o; = —=. 


Vn 


3. Ifx is normally distributed, so is x, regardless of sample size. 


4. Ifthe sample size is large, ¥ is approximately normally distributed, regardless of the distribution 
of x. 


Example E4.10 


Refer to Problem P4.2. Suppose the random variable X in Problem P4.2 is approximately normally distributed, 


determine P(115 < X < 125) for samples of size 5 drawn with replacement. 
SOLUTION: 


From the solution of P4.2, the mean and standard deviation of X are 1, = 120 and G, = 6.7082. Kas 
approximately normally distributed. Hence, 


P(IS< X <125)= P| HS i ee) 


6.7082 6.7082 ~—s- 6.7082 


= P(-0.745 < z< 0.745) (See the table in Appendix-£). 
Where z is the standard normal random variable. 


= (0.229650 + 0.770350) = 0.5407 


Example E4.11 
Se 


Refer to Problem P4.3. Suppose the random variable X in Problem P4.3 is approximately normally distributed, 


find P(115 < X < 125) for samples of size 5 drawn with replacement, 
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SOLUTION: 


Refer to the solution of Problem P4.3. The mean and standard deviation of X are Uy = 120and of = 616622. 


X is approximately normally distributed. Hence, 


= 5 = 120. ¥ =190 125-1201 
Pamnbe  <2sye P| 15— 120 X—120 - 125-120 = : ahs 
666AD) BIGE4D «GRE = P(-0.75039 < z < 0.75039) 


where z is the standard normal random variable. 
Using the standard normal table in Appendix-E, we have 
P(-0.75039 < z < 0.75039) = (-0.226627 + 0.773373) = 0.54675 


4.7 POPULATION AND SAMPLE PROPORTIONS 


The population proportion, denoted by p, is obtained by taking the ratio of the number of elements in a 
population with a specific characteristic to the total number of elements in the population. In general, when 
we pick a sample, 


number of items in the sample having the attribute 


Sample proportion of an attribute = (4.13) 


sample size 


Hence, if X is the number of items having a certain attribute in a sample of size n. then the sample 
proportion having the attribute is the random variable X/n. The probability distribution of this statistic is 
called the sampling distribution of the proportion. 


The population and sample proportions, denoted by p and p. respectively. are calculated as 


xX oe 
Daa and oe (4.14) 
where WN = total number of elements in the population 
n = total number of elements in the sample 
X = number of elements in the population that possess a specific characteristic 
x = number of elements in the sample that possess a specific characteristic 


The sampled population can be described by the following probability distribution: 


Probability 
1-p 


The computations for finding the mean and the variance of this population are shown in Table 4.1. 


Table 4.1 


Probability 
p(x) 


ee 
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From column 3 in Table 4.1, Xxp(x) = p. 

Hence, the population mean [J is p or UW = p . (4.15) 
Similarly the sum in column 4 is Xx*p(x) = p. 

Hence, the variance of the population, 6” is given by 


O° = Lx’p(x) — = p — p’= p( —p) 
Therefore, population standard deviation = + p(1— p) (4.16) 


‘SAMPLING DISTRIBUTIONOFi #  } — 


4.8 


Similar to the sample mean X , the sample proportion, /, is also a random variable. It possesses a probability 


distribution called its sampling distribution. It gives the various values that p can assume and their 


probabilities. 


MEAN AND STANDARD DEVIATION OF ? 


4.9 


The mean of p, which is the same as the mean of the sampling distribution p, is always equal to the 
population proportion, p. 


Hence, H,=p (4.17) 
The sample proportion, ?, is called an estimator of the population proportion, p. When the expected 
value (or mean) of a sample statistic is equal to the value of the corresponding population parameter, that 
sample statistic is said to be an unbiased estimator. Since for the sample population, 4; =p, pis an unbiased 


estimator of p. The standard deviation of p, denoted by ©,, is given by 


__ [pd=p) _ ie 
0; ; F | (4.18) 


where p is the population proportion, g = 1 — p, and n is the sample size. 


For large values of n (n 2 30) the sampling distribution is very closely normally distributed. Note that 
the population is binomially distributed. The Eq. (4.18) is also valid for a finite population in which sampling 
is with replacement. 


However, if n/N = 0.05, then 0 p is given by 


_ |pe > Nan 
05 =e = vey (4.19) 


N=n , ae : P 
is called the finite population correction factor. 


where the factor 


In general, the sample size, n is small compared to the population size, N and Eq.(4.18) is used. 
If the standard deviation of a sample statistic decreases as the sample size is increased, that statistic is 


called the consistent estimator. It is clear from Eq.(4.18) that as n increases, the value of J pqin decreases 


and the sample proportion, p, is said to be the consistent estimator of the population proportion, p. 
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The shape of the sampling distribution of p is inferred from the central limit theorem described in 
Section 3.8 of Chapter 3. According to the central limit theorem, the sampling distribution of p is 
approximately normal for a sufficiently large sample size. In the case of a proportions, the sample size is 
considered to be sufficiently large if np and n(1 — p) are both greater than 5. That is. if np > 5 and n(1 ~ p) > 
5. This condition is the same as required for the application of the normal approximation to the binomial 
probability distribution described in Chapter 3 (Section 3.5). 


Example E4.12 


A company manufactures engine cylinders. The machine that is used to make these cylinders is known to 
produce 6% detective cylinders. If a sample of 100 cylinders are selected every week and inspected them 
for being good or defective. If 8% or more of the cylinders in the sample are defective. the process is 
stopped and the machine is readjusted. Determine the probability that based on a sample of 100 cylinders 
the process will be stopped to readjust the machine. 


SOLUTION: 
Here, p=0.06, g=1-—p=1-—0.06 = 0.94 and n= 100. 


m Se - | - — 
n n 100 


(0.08 — 0.06) =0.84 
0.023749 


Therefore, P( p = 0.08) = P(z=0.84) = P(1 —z< 0.84). Refer to the table in Appendix-E. 
P( p = 0.08) = (1 — 0.799546) = 0.20045 


For p = 0.08, we have 


ie 
4 


Example E4.13 


Let m be the mean annual salary of faculty members in a college for 1995. Assume that the standard deviation 
of the salaries of these faculty members is $50,000. What is the probability that the 1995 mean salary of a 


i. 
random sample of 100 faculty members was within $10,000 of the population mean, #1? Assume that rs £0.05. 


SOLUTION: 
Given 6 = $50,000 and n = 100 
Standard deviation of the sample mean 


S _ WN = $5000 


tn Si00 
The required probability is 
P{— 10,000 < x <p+ 10,000) 
For ¥ = p— 10,000: z = [(u — 10,000) — pJ/5,000 = -2 
For X =u+ 10,000: z= [(u + 10,000) — /5,000 = 2 
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Hence P(u— 10,000 < x <u+ 10,000) = P-2<zS2) 
= (-0.022750 + 0.977250) from the table in Appendix-E 
= 0.9545 


Example E4.14 


A particular city is planning to build a nuclear power plant to generate the electric power. An independent 
survey fund that 53% of the voters in that city favour the building of that plant. Assume that this result 
holds true for the population of all the voters in this city. 


(a) what is the probability that more than 50% of the voters in a random sample of 200 voters selected 
from this city will favour the building of this plant? 

(b) acity official would like to take a random sample of voters in which over 50% would favour the 
plant building. How large a sample should be selected so that the city official is 95% sure of this 


n 
outcome? Assume W < 0.05. 


SOLUTION: 
Given p=0.53,n = 200 and n/N =< 0.05 
(a) Standard deviation of the sample mean 


ae [p= p) : {(0.53)(0.47) ~ 0.03529 
n 200 


The shape of the sampling distribution is approximately normal. In order to have over 50% in favour in a 
sample of 200 requires 101 or more in favour of the plant building. Hence, we require 


101 
P| p>—| = P(p>0.505 
[p oa (p ) 


b-p _ 0505-0. 
: en 


So, 0.03529 


P( p 20.505) = P(z 2 -0.71) = 1 — P(z S$ -0.71) = 1 — 0.238852 = 0.76115 


(from the table in Appendix-E). 
(b) P(z> -1.65) = 1 — P(z <—1.65) = | — 0.049471 = 0.95053 (from the table in Appendix-E). 


Since Z= 


ji 2 . 05—053 
re 165 


Since o,= [p= p) 
n 


p(l-p) _ 0.53(0.47) 
(o,) (0.01818)° 


= = 0.01818 


n= 


= 753.53 = 754 


Hence, the sample should include at least 754 voters. 
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Example E4.15 


A euDple of 49 is picked at random from a population of manufactured steel circular rods. If the standard 
deviation of the distribution of their diameters is known to be 3 cm, find the standard error of the mean if 
(a) the population consists of 1000 steel rods 
(b) the population is extremely large 


SOLUTION: 


(a) We are giving that n = 49, N = 1000, and o = 3. Hence, the standard error of the mean is 


o [N- 3 /1000-49 
Be = = (0.97568) = 0.418 


vnVN-1 49 Y 1000-1 


(b) If the population size is extremely large (practically infinite), the standard error is given by 


ee 2 L786) 


vn J49 7 


4.10 THE CHI-SQUARE DISTRIBUTION 


The chi-square distribution is sometimes written as x° distribution and read as chi-square distribution. 
The values of a chi-square distribution are denoted by the symbol x°*, just as the values of the standard 
normal distribution and the ¢-distribution are defined by z and ¢, respectively. A variable has a chi-square 
distribution if its distribution has the shape of a specific type of right-skewed curve. called a chi-square 
(x?) curve. There are infinitely many chi-square distributions. The chi-square curve is identified by its number 
of degrees of freedom. 

Let Z,, Z,, ..., Z, be normally and independently distributed random variables, with mean p = 0 and 
variance 6” = 1. Then the random variable 


( =Z4+Z5 ++Z, 
has the probability density function 


Oe 1 ykiDo-ul2—w>0 
iB ee 


26 otherwise (4.20) 


and is said to follow the chi-square distribution with k degrees of freedom abbreviated Rigs 


The mean and variance of the re distribution are 
(4.21) 


(4.22) 


=k 
and Oo = 2k 


Several chi-square distributions are shown in Fig. 4.1. 
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. . 2 
Some basic properties of X°-curves are: 


1. The total area under a X’-curve equals 1. 

2. A X-curve starts at 0 on the horizontal axis and 
extends indefinitely to the right, approaching, but 
never touching, the horizontal axis as it does so. 

3. AX -curve is right-skewed. 

As the number of degrees of freedom becomes 
larger, X-curves look increasingly like normal curves. 

Percentages (and probabilities) for a variable having a 
chi-square distribution are equal to areas under its associated 
X’-curve. 

The chi-square random variable is non-negative, and the 
probability distribution is skewed to the right. As & increases, 
the distribution becomes more symmetric. As k — o, the 
limiting form of the chi-square distribution is the normal 


0 5 10 15 20 25 


distribution. The percentage points of the ie distribution are 
A Fig. 4.1: Several X’ distributions 
given in Appendix-F. Define Xa,« as the percentage point or 


value of the chi-square random variable with k degrees of freedom such that the probability that ie exceed 
this value is a. That 


Pi xg Kan} = j fa (w)du=o (4.23) 


Xak 


The probability in Eq. (4.23) is shown in Fig. 4.2. 


oe xe 


Fig. 4.2: Percentage point Gy of the chi-square distribution 
The two outside columns of table in Appendix-F, labeled df, display the number of degrees of freedom. 
The symbol X. denotes the X’-value having area ©: to its right under a X’-value. Hence, the column headed 


2 c : 
Xo.095, for example, contains %’-values having area 0.995 to their right. 


Example E4.16 
ce 


Determine the X’-value having area 0.05 to the left for a X°-curve with df = 9 as shown in Fig. E4.16(«). 


0.05 


Fig. E4.16 


SOLUTION: 


The total area under a X’-curve equals | — 0.05 = 0.95. Hence, the required X°-value is ‘i yys . From the table 


2s 


in Appendix-F with df= 9, X.,,, = 3.325. Therefore, for a X°-curve with df= 9, the X-values having area 0.05 
to its left is 3.325 as shown in Fig. E4.16(d). 


Example E4.17 


For a X°-curve with df = 25, determine the two X-values that divide the area under the curve into a middle 
0.95 area and two outside 0.025 areas as shown in Fig. E4.17(a). 


0.05 


Fig. E4.17 


SOLUTION: 
First, we find the X?-value on the right in Fig. E4.17(a). Because the shaded area on the right is 0.025, the 


%°-value on the right is X05. From the table in Appendix-F with df= 25, X0.o2s = 40.647. 
Next, we find the X°-value on the left in Fig. E4.17(a). Because the area to the left of that Z-value is 


: a : ; 
(0.025, the area to its right is | — 0.025 = 0.975. Hence, the X°-value on the left is X,,.. which. by table in 
Appendix-F equals 13.120 for df= 25. 
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Consequently, for a xX--curve with df = 25, the two x°-values that the area under the curve into a middle 
0.95 area and two outside 0.025 areas are 13.120 and 40.167, as shown in Fig. E4.17(0). 


4.11 THEt-DISTRIBUTION 


The standardised version of ¥ has the standard normal distribution. William Gosset in 1908 determined the 


distribution of the studentised version of ¥ , a distribution now called student's t-distribution or, simply, 
the t-distribution. 

There is a different r-distribution for each sample size. A particular r-distribution is identified by its 
number of degrees of freedom (df). For the studentised version of x , the number of degrees of freedom is 


1 less than the sample size, which is indicated symbolically by df =n — 1. The mean of the ¢-distribution is 0 
just like the standard normal distribution. But unlike the standard normal distribution, whose standard 


deviation is |, the standard deviation of a r-distribution is - > which is always greater than 1. Hence, 
di 


the standard deviation of a 1-distribution is larger than the standard deviation of the standard normal 
distribution. 

The number of degree of freedom is the only parameter of the ¢-distribution. A variable with a 
t-distribution has an associated curve, called a t-curve. There is a different f-distribution for each number of 
degrees of freedom. 

The f-distribution is a specific type bell-shaped distribution with a height and a wider spread than the 
standard normal distribution. As the sample size becomes larger, the ¢-distribution approaches the standard 
normal distribution. 


Let Z ~ N(O, 1) and V be a chi-square random variable with k degrees of freedom. If Z and v are 
independent, then the random variable 


ZL 


Wik (4.24) 


has the probability density function 


y= +01, 1 
Vk T(k/2) ((t2 1k) +1]? et (4.25) 


and is said to follow the t-distribution with k degrees of freedom, abbreviated iS 


The mean and variance of f are u = 0 and 6° = k/(k — 2) for k > 2, respectively. Several t-distributions 
are shown in Fig. 4.3. 
The basic properties of t-curves are: 
1. The total area under a t-curve equals 1. 
2. A t-curve extends indefinitely in both directions, approaching, but never touching, the horizontal 
axis as it does so. 
3. A t-curve is symmetric about 0. 


As the number of degrees of freedom becomes larger, t-curves look increasingly like the standard 
normal curve. 
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Note that if k =o, the -distribution becomes the standard normal distribution. A table of percentage 
points of the r-distribution is given in Appendix-G. If X).N,. +...¥, is a random sample from N(u, 6°) distribution, 
then the quantity 


xy 


 — 
siVn 


is distributed as t with n — 1 degrees of freedom. 


We will let ¢,, be the value of the random variable I with k degrees of freedom above which we find an 
area (or probability) @. Thus ¢,, is an upper-tail 100a percentage point of the t-distribution with & degrees 
of freedom. This percentage point is shown in Fig. 4.4. In the Appendix-G, the @ values are the column 
headings, and the degrees of freedom are listed in the left column. To illustrate the use of the table. note 
that the r-value with 15 degrees of freedom having an area of 0.05 to the right is f = |. 758eTibaters: 


0.05.10 
eu > tf Ve (58) 005 


10 OOS 15 10 


Since f-distribution is symmetric about zero, we have rf, = -f,; that is. the -value having an area of 


l-us 


1 — a to the right (and therefore an area of & to the left) is equal to the negative of the r-value that has area 
a in the right tail of the distribution. Therefore, ¢, 


0.9515 


t —1.753. 


0.05.15 — 


k = oo (normal) 


0 


Fig. 4.3: Several t-distributions 


tiok= —tok 0 to,k 


Fig. 4.4: Percentage points of the f-distribution 


Example E4.18 
Find a -curve with 17 degrees of freedom, determine ,.,,;. In other words, find the value having area 0.05 


to its right, as shown in Fig. F4.18(a). 
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Area = 0.05 


t= 1.740 


Fig. E4.18 


SOLUTION: 


The number degrees of freedom is 17, so we first go down the outside columns, (from the table in 
Appendix-E) labeled df. to “17”. Then, going across that row to the column labeled f,,., we reach 1.740. 
This number is the t-value having area 0.05 to its right, as shown in Fig. E4.18(5). In other words, for a 
t-curve with df = 17, t,,. = 1.740. 


0.05 


4.12 THE F-DISTRIBUTION 


A variable is said to have an F-distribution if its distribution has the shape of a special type of right- 
skewed curve, called an F-curve. There are infinitely many F-distributions. The shape of a particular 
F-distribution curve depends on the number of degrees of freedom. There are two numbers of degrees of 
freedom for the F-distribution curve. The first number of degrees of freedom for an F-curve is called the 
degrees of freedom for the numerator and the second the degrees of freedom for the denominator. 

The random variable F is defined to be the ratio of two independent chi-square random variables, each 
divided by its number of degrees freedom. That is 


_Wlu 
Y/v 


where W and Y are independent chi-square random variables with u and v degrees of freedom, respectively. 


(4.26) 


Let W and Y be independent chi-square random variables with u and v degrees of freedom, respectively. 
Then the ratio 


W/u 
he 
Y/v (4.27) 


has the probability density function 
u/l2 
ieveleas 
2 v 


JOP 


and is said to follow the F-distribution with u degrees of freedom in the numerator and v degrees of freedom 
in the denominator. It is usually abbreviated as F 


f(x)= O0<x<0o (4.28) 


The mean and variance of F-distribution are 4 = v/(v — 2) for v > 2, and 
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2 2v? (u+v—2) 
u(v — 2)" (v—4) 


v>4 


The F tandom variable is non-negative, and the distribution is skewed to the right. The F-distribution 
looks very similar to the chi-square distribution as shown in Fig. 4.5; however the two parameters w and \' 
provide extra flexibility regarding shape. 


f(x) 


Fig. 4.5: Three F-distribution curves 


The basic properties of F-curves are: 
1. The total area under an F-curve equals 1. 
2. An F-curve starts on O on the horizontal axis and extends indefinitely to the right, approaching, 
but never touching, the horizontal axis as it does so. 
3. An F-curve is right-skewed. 
For an F-curve with df = (u, v), the F-value having area © to its left equals the reciprocal of the /-value 
having area © to its right for an F-curve with df= (v, u). 


The percentage points of F distribution are given in Appendix-H. Let /, be the percentage point of 
F-distribution, with the numerator degrees of freedom uw and the denominator degrees of freedom vy such 
that the probability that the random variable F exceeds this value is 


Ee) ae (2) doo. (4.29) 
herr 
This is illustrated in Fig. 4.6. 
f(x) 


0 if tins 


1-a.,.UY¥ 


Fig. 4.6: Upper and lower percentage points of the F-distribution 
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For example, if u = 6 and v = 10, we find from the table in Appendix-H, that 
PCF > fogs sig) = Ps 19 > 3-22) = 0.05 


5,10 
That is, the upper 5 percentage point of F,,, 18 fyyss 49 = 3-22: The table in Appendix-H contains only 
upper-tail percentage points (for selected values of f,,, for @ S$ 0.25) of the F-distribution. The lower-tail 


percentage point f, , ,, can be found as follows: 


l (4.30) 
f lew = je 
tenes 
For example, to find the lower-tail percentage point of f,,., ,,, note that 
1 i 
fo9s,610 = =—— = 7a = 0.246 


Fo.05.10.6 4.06 


Example E4.19 


For an F-curve with df(5. 13), find F,,,.. That is, find an F-value having area 0.05 to its right as shown in 
Fig. E4.19(a). 


F-curve F-curve 


at = (5,13) ai = (5,13) 


Area = 0.05 Area = 0.05 


Fig. £4.19 


SOLUTION: 


To find the F-value, we use the table in Appendix-H. In this case, & = 0.05 (area in the right tail under the 
F-distribution curve). The degrees of freedom for the numerator is 5, and the degrees of freedom for the 
denominator is 13. 


We first go down the degree of freedom to “13”. Next, we concentrate on the row for a labeled 0.05. 
Then, going across that row to the column labeled “5” (from the table in Appendix-E), we reach 3.03. This 
number is the F-value having area 0.05 to its right, as shown in Fig. E4.19(b). In other words, for an F-curve 
with df(5, 15), F,,, = 3.03. 


Example E4.20 
For an F-curve with df = (50, 8) having an area 0.05 to its left. 


SOLUTION: 


The required F-value is the reciprocal of the F-value having area 0.05 to its right for an f-curve with 
df = (8, 50). From the table in Appendix-H, this latter F-value equals 2.13. Consequently, the required 


1 
F-value is 7B or 0.4695, as shown in Fig. E4.20. 
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F-curve 


af = (50, 8) 


0.05 


wl 
543 = 0.4695 


Fig. £4.20 


Example E4.21 


For an F-curve with df = (10, 8), determine the two F-values that divide the area under the curve into a 
middle 0.95 area and two outside 0.025 areas as shown in Fig. E4.21(a). 


SOLUTION: 


First, we find the F-value on the right in Fig. E4.21(a). Because, the shaded area on the right is 0.025, the 
F-value on the right is F, ,.. From the table in Appendix-H, with df= (10, 8), F,.,. = 4.30. 


0.025° ),025 
Next, we find the F-value on the left in Fig. E4.21(a). The F-value is the reciprocal of the F-value having 
area 0.025 to its right for an F-curve with df(10, 8). From the table in Appendix-H. we find that this latter 


1 
-val [So O26) 
F-value equals 4.30 


Consequently, for an F-curve with df(10, 8), the two F-values that divide the area under the curve into a 
middle 0.95 area and two outside 0.025 areas are 0.2326 and 4.30, as shown in Fig. E4.21(5). 


Fig. £4.21 


In Chapter 3, we discussed the probability distributions of discrete and continuous random variables. In 
this chapter, we extended the concept of probability distribution to that of a sample statistic. A sample 
statistic is a numerical summary measure calculated for sample data. The numerical summary measures 
calculated for population data are called population parameters. A population parameter 1s always a bonnet. 
whereas a sample statistic is always a random variable. Also, since every random variable must possess a 
probability distribution, each sample statistic possesses a probability distribution. The probability ina ah 
of a sample statistic is more generally called its sampling distribution. In this chapter. we pannasiad - 
sampling distributions of the sample mean and the sample proportion. The concepts presented in this chapter 
forms the foundation for the inferential statistics discussed in Chapters 5 and 6. 
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PROBLEMS 


P4.1 


P4.2 


P4.3 


P4.4 


P4.5 


P4.6 


P4.7 


P4.8 


P4.9 


A population consists of the values 1, 2 and 5. 

(a) list all the possible samples (with replacement) of size n = 2 along with the sample means and 
their individual probabilities. 

(b) find the mean of the sampling distribution 

(c) does the sample mean target the values of the population mean? 

A manufacturing company produces machine parts that have a mean tensile strength of 100 MPa 


and a standard deviation of 10 MPa. The distribution of tensile strength is normal. Find the 
probability that a tensile strength less than 95 MPa. 


Suppose that a random variable X has a continuous uniform distribution 


M/s WES ye S (6) 
f(xy= 


0. otherwise 


Find the distribution of the sample mean of a random sample of size n = 50. 


Assume that the weights of 4000 male students at a university are normally distributed with a 
mean of 60 kg and standard deviation 3.0 kg. If 100 samples consisting 25 students each are obtained, 
what would be the expected mean and standard deviation of the resulting sampling distribution of 
means if sampling were done (a) with replacement, (b) without replacement? 


A survey showed that the mean expenditure incurred by a student in 1995 was $9000 and the 
standard deviation of the expenditure was $800. Find the approximate probability that the mean 
expenditure of 64 students picked at random was 


(a) more than $8820 
(b) between $8800 and $9120 


Refer to Problem P4.4. In how many samples of Problem P4. 4 would you expect to find the mean 
(a) between 58.8 and 60.3 kg 

(b) less than 58.4 kg 

The length of life (in hours) of a certain type of machine component is a random variable with a 


mean of 600 hours and a standard deviation of 70 hours. What is the approximate probability that 
a random sample of 196 machine parts will have a mean life between 588 and 605 hours? 


Five hundred electronic components have a mean weight of 6.02 g and a standard deviation of 
0.3 g. Find the probability that a random sample of 100 electronic components chosen from this 
group will have a combined weight of 

(a) between 596 and 600 g 


(b) more than 610 g 
Assume that the weights of machine parts of a heavy machine are normally distributed with mean 


55 kg and standard deviation 2 kg. A number of samples of 100 parts each are taken at random with 
replacement from the population. Determine 
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P4.10 


P4.11 


P4.12 


P4.13 


P4.14 


P4.15 


P4.16 


(a) the mean and standard deviation of the sampling distribution of the mean 

(b) the probability that the sample mean will differ from the population mean by less than 0.4 kg. 
It was found in a particular survey that adults spend an average of 10 hours a day at work and 
commuting. Let the daily work and commute times for all adults have a mean of 10 hours and a 


standard deviation of 2 hours. Find the probability that the mean of the daily work and commute 
times for a random sample of 100 adults will be 


(a) greater than 10.45 hours 

(b) between 9.75 and 10.50 hours 

(c) within 0.25 hours of the population mean 

(d) lower than the population mean by 0.50 hours or more. 


A random sample of size 121 is taken from a normal population with o = 27.5. What is the probability 
that the mean of the sample will differ from the mean of the population by 3 or more either \ ay? 


The diameters of ball bearings manufactured by a company are normally distributed with mean 
12 mm and standard deviation 0.1 mm. A sample of 25 ball bearings was taken each day during a 
given month. Find 


(a) the measured standard deviation of the distribution of the sample means 
(b) the probability that the sample mean of ball bearings will 
(i) exceed 12.01 mm 
(ii) be less than 11.98 mm 
(iii) lie between 11.98 and 12.1 mm 
A random sample of size of 81 is taken from a normal population with W = 61.4 and 6 = 7.65. What 
is the probability that the mean of the sample will 
(a) exceed 62.9 
(b) fall between 60.5 and 62.3 
(c) be less than 60.6 
A random sample of 300 air compressors had a standard deviation of 50 months. Determine the 


probability that the estimate on the population mean lifetime of these compressors will be within 6 
months of the true mean from the sample estimate. 


According a particular survey, the standard deviation of the lengths of time that men with one job 

are employed during the first 10 years of their career is 100 weeks. Length of time employed during 

the first 10 years of career is left-skewed variable. For this variable. find the following: 

(a) the sampling distribution of the sample mean for simple random samples of 100 men with one 
job 

(b) the probability that the sampling error made in estimating the mean length of time employed 
by all men with one job by that of a random sample of 100 such men will be at most 25 
weeks. 


In a large university, it was found that the score in statistics class with large number students 
over a number of years has a distribution with mean 80 and standard deviation 16. A random sample 
of 64 students of every class for 20 classes is taken. For the sampling distribution of mean, find 
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P4.17 


P4.18 


P4.19 


P4.20 


P4.21 


P4.22 


P4.23 


(a) the mean and standard deviation 


(b) the probability that 


(i) X >84 
(ii) X >86 
(iii) X <80 


where X is the sample mean found for the sample size of 64. 

According to a particular survey, the mean annual salary of private classroom teachers is $45,000 

and standard deviation of $9000. 

(a) find the sampling distribution of the sample mean for sample size of 81 

(b) repeat part (a) for samples of 289 

(c) is the assumption that classroom teachers salaries are normally distributed to answers in 
parts (a) and (b) necessary? 

(ad) what is the probability that the sampling error made in estimating the population mean salary 
of all classroom teachers by the mean salary of a sample of 81 classroom teachers will be at 
most $1000 

(e) repeat part (d) for samples of size 289. 

The weight of electronic components packed in certain containers is a random variable with a 


mean weight of 16 g and a standard deviation of 0.6 g. If the containers are shipped in boxes of 36, 
find, approximately, the probability that a randomly picked box will weigh over 585 g. 


According a particular survey it was found that the standard deviation of the lengths of hospital 
stay on the intervention ward is 9 days. 


(a) for the variable “length of hospital stay”, determine the sampling distribution of the sample 
mean for samples of 100 patients on the intervention ward 


(b) the distribution of the length of hospital stay is right-skewed. Does this validate or invalidate 
the results found in part (a)? 


(c) find the probability that the sampling error made in estimating the population mean length of 
stay on the intervention ward by the mean length of stay of a sample of 100 patients will be 
at most 3 days. 


The proportion of employed men between the ages 21 and 40 years in a city is 2/3. Suppose random 
samples of size 25 are drawn with replacement from all men in that city between the ages 21 and 40. 


What are the mean and standard deviation of the proportion p for all such samples? 
In a particular survey in a large metropolitan city nearly 20% of all students were at least somewhat 
afraid of being attacked in or nearby their schools. Suppose 20% of all students in grade 10 are 


afraid of such attacks. If p is the proportion of students in a random sample of 49 of grade 10 


students who fear such attacks, find the mean and standard deviation of p. 


Reter to Problem P4.20. Suppose the city has 250 men between ages 21 and 40 years, and the 
sampling is without replacement. What are the mean and standard deviation of pe 


Suppose 25% of all workers in a state belong to a labour union. What is the probability that in a 
random sample ot 100 workers in that state, at least 18% will belong to a labour union? 
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- REVIEW QUESTIONS 


ir 
2. 


10. 


11. 


12. 


Explain the meaning of a population distribution and a sampling distribution. 
What is sampling error? Does such an error occur only in a sample survey or can it occur both in a 
sample survey and a census? 


Explain the meaning of nonsampling error. Do such errors occur only in a sample survey or can 
they occur both in a sample survey and a census? 


Describe the condition or conditions that must hold true for the sampling distribution of the sample 
mean to be normal when the sample size is less than 30. 


Describe the central limit theorem (Refer to Chapter 3, Section 3.8). 

If all possible samples of the same (large) size are selected from a population, what percentage of 
all the sample means will be within 2.5 standard deviations of the population mean? (Chapter 3. 
Section 3.9). 


If all possible samples of the same (large) size are selected from a population. what percentage of 
all the sample means will be within 1.5 standard deviations of the population mean? (Chapter 3. 
Section 3.9). 

If all possible samples of the same (large) size are selected from a population, what percentage of 
all the sample means will be within 2.0 standard deviations of the population mean? (Chapter 3. 
Section 3.9). 

If all possible samples of the same (large) size are selected from a population. what percentage of 
all the sample means will be within 3.0 standard deviations of the population mean? (Chapter 3, 
Section 3.9). 

Define the following terms: 

(a) Consistent estimator 

(b) Estimator 

(c) Mean of p 


(d) Meanof x 

(e) Population proportion, p 
(f) Unbiased estimator 
Explain the following terms: 

(a) Sample proportion 

(b) Sampling distribution of p 
(c) Sampling distribution of x 
(d) Standard deviation of p 


(e) Standard deviation of x 

Describe very briefly the following distributions: 
(a) t-distribution 

(b) F-distribution 

(c) chi-square distribution 
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ah STATE TRUE OR FALSE 7 i 


i 


i) 
(Ss) 


Sampling error is the error resulting from using a sample to estimate a population characteristic. 


(True/False) 
For a variable x and a given sample size, the distribution of the variable ¥ is called the sampling 
distribution of the population mean. (True/False) 
The larger the sample size, the smaller the sampling error tends to be in estimating a population 
mean, Lt, by asample mean x . (True/False) 
For a sample of size n, the mean of the variable X¥ equals the mean of the variable under 
consideration. (True/False) 
For samples of size n, the standard deviation of the variable x equals the standard deviation of 
the variable under consideration multiplied by the square root of the sample size. (True/False) 
A normal distribution is determined by the mean and standard deviation. (True/False) 
The mean of all possible sample means (i.e., of the variable x ) always equals the population mean. 

(True/False) 
The larger the sample size, the larger is the standard deviation x . (True/False) 
The smaller the standard deviation of ¥ , the more closely the possible values x of (the possible 
sample means) cluster around the mean of x . (True/False) 
For a relatively large sample size, the variable x is approximately normally distributed.(True/False) 
The total area under X’-curve equals 1. (True/False) 
A X-curve starts at 0 on the horizontal axis and extends indefinitely to the right, approaching, but 
never touching, the horizontal axis as it does so. (True/False) 
A X-curve is left skewed. (True/False) 
As the number of degrees of freedom becomes larger, X'-curves look decreasingly like normal curves. 

(True/False) 
The larger the standard deviation of x , the more closely the possible values of x (the possible 
sample means) cluster around the mean of xX . (True/False) 


For samples of size n, the variable "7 -( 5 Ss” has the chi-square distribution with n — 1 degrees 
=e 


of freedom. (True/False) 
A variable is said to have a chi-square distribution if its distribution has the shape of a special 
type of right-skewed curve, called a chi-square curve. (True/False) 
Different chi-square distributions are identified by their degrees of freedom. (True/False) 
The total area under an F-curve equals 1. (True/False) 
An F-curve starts at 0, on the horizontal axis and extends indefinitely to the left, approaching, but 
never touching, the horizontal axis as it does so. (True/False) 
An F-curve with df = (v,, v,), the F-value having area to its left equals the reciprocal of the 
F-value having area © to its right for an F-curve with df = kes, v,). (True/False) 
An F-distribution and its corresponding F-curve are identified by stating its two numbers of degrees 
of freedom. (True/False) 


An F-distribution has two numbers of degrees of freedom. (True/False) 
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The first degree of freedom in F-distribution is called the degrees of freedom for the numerator. 


(True/False) 
The second degree of freedom in F-distribution is called the degrees of freedom for the denominator. 
; (True/False) 
A X-curve looks increasingly like a normal curve as the number of degrees of freedom becomes 
larger. (True/False) 
An F-curve is right-skewed. (True/False) 
For an F-curve with df = (15, 5), the F-value having area 0.05 to its left equals the reciprocal of the 
F-value having area 0.05 to its right for an F-curve with df = (5, 15). (True/False) 
The observed value of a variable having an F-distribution must be greater than or equal to 1. 
(True/False) 
The total area under a t-curve equals 1. (True/False) 
A t-curve extends indefinitely in both directions, approaching, but never touching, the horizontal 
axis as it does so. ; (True/False) 
A t-curve is not symmetric about 0. (True/False) 


As a number of degrees of freedom becomes larger, t-curves look increasingly like the F-distribution 
curve. (True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. False 3.True 4. True 5.False 6. True 7. True’ 8. False 9. True 10. True 
11. True 12. True 13. False 14. False 15. False16. True 17. True 18. True 19. True 20. False 
21. True 22. True 23. True 24. True 25. True 26. True 27. True 28. True 29. False 30. True 
31. True 32. False 33. False 


TO 


CHAPTER 


Estimation 


Inferential statistics is the part of statistics that helps us to make decisions about some characteristics of a 
population based on sample information. Estimation is a procedure by which numerical value or values are 
assigned to a population parameter based on the information collected from a sample. 


In inferential statistics, u is called the true population mean and p is called the true population 
proportion. The value(s) assigned to a population parameter based on the value of a sample statistic is 
called an estimate. The sample statistic used to estimate a population parameter is called an estimator. 


If the mean of the sampling distribution of a statistic equals to the corresponding population parameter, 
the statistic is called an unbiased estimator of the parameter, otherwise it is called a biased estimator. The 
corresponding values of such statistics are called unbiased or biased estimates respectively. If the sampling 
distributions of two statistics have the same mean (or expectation), the statistic with the smaller variance is 
called an efficient estimator of the mean while the other statistic is called an inefficient estimator. The 
corresponding values of the statistic are called efficient or inefficient estimates respectively. 


Statistical inference is divided into two major categories: (1) Parameter estimation and (2) Hypothesis 
testing (see Chapter 6). Parameter estimation is further classified into two major types: (1) Point estimation 
and (2) Interval estimation. 


5.1 POINT ESTIMATION 


Estimation is a procedure by which numerical value or values are assigned to a population parameter based 
on the information collected from a sample. In inferential statistics, is called the true population mean 


and p is called the true population proportion. The value(s) assigned to a population parameter based on 
the value of a simple statistic is called an estimate of the population parameter. 


If the population is small, we can ordinarily determine pt exactly by first taking a census and then 
computing 1 from the population data. But if the population is large, as it often is in practice, taking a 
census is generally impractical, extremely expensive, or impossible. Nonetheless, we can usually obtain 
sufficiently accurate information about yt by taking a sample from the population. 


An estimate of a population parameter given by a single number is called point estimate of parameter. 
An estimate of a population parameter is given by two numbers between which the parameter may be 
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considered to lie is called an interval estimator of the parameter. The sample statistic used to estimate a 


population parameter is called an estimator. Hence, the sample mean, X is an estimator of the population 


mean, Lt and the sample population, p is an estimator of the population proportion, p. 


The desired estimators depend on the distribution used. These estimators are not always given by X 
2 . . . . . , 
and $*. The common distributions and their estimators are given below. 


Distribution | Parameter 


Suggested Estimator(s) 


Poisson Xr A= YX 
Binominal P p= X 
Normal LL, O° 6° = S’ (denominator is n — 1) | 


After a sample is taken and the data are analysed, the result is an estimate. 
The estimation procedure involves the following steps: 

1. Select a sample 

2. Collect the required information from the members of the sample 

3. Calculate the value of the sample statistic 

4. Assign value(s) to the corresponding population parameter 


If we select a sample and compute the value of the sample statistic for this sample. the value gives the 
point estimate of the corresponding population parameter. 


5.2 INTERVAL ESTIMATION 


An interval estimate for a population parameter is called a confidence interval. In interval estimation, instead 
of assigning a single value to a population parameter, an interval is constructed around the point estimate 
and then a probabilistic statement that this interval contains the corresponding population parameter is 
made. This probabilistic statement is given by the confidence level. An interval that is constructed based 
on the confidence level is called a confidence interval. The contidence level associated with a confidence 
interval states how much confidence we have that the interval contains the true population parameter. The 
confidence level is denoted by (1 — )100%. When expressed as probability, it is called the confidence 
coefficient and is denoted by (1 — a). Note that is called the significance level. 

A tolerance interval is another important type of interval estimate. For a normal distribution, we know 
that 95% of the distribution is in the interval p — 1.96 6, w+ 1.96 0. 

Confidence and tolerance intervals bound unknown elements of a distribution. A prediction interval 
provides bounds on one (or more) future observations from the population. 


Summarising, the purpose of the 3 types of interval estimates are: 

A confidence interval bounds population or distribution parameter (such as the mean weight of a person). 
A tolerance interval bounds a selected proportion of a distribution. 

A prediction interval bounds future observations from the population or distribution. 


A point estimate of a parameter consists of a single value with no indication of the accuracy of the 
estimate. A confidence interval consists of an interval of numbers obtained from a point estimate of the 


206 // Probability and Statistics for Scientists and Engineers // 


parameter together with a percentage that specifies how confident we are that the parameter lies in the 
interval. 

A confidence interval estimate of a parameter consists of an interval of number obtained from the point 
estimate of the parameter together with a ‘confidence level’ that specifies how confident we are that the 
interval contains the parameter. This is superior to a point estimate because it provides some information 
about the accuracy of the estimate whereas a point estimate does not. 

Interval estimates indicate the precision or accuracy of an estimate and are therefore preferable to point 
estimates. A statement of the error of precision of an estimate is often called its reliability. An interval 
estimate consists of two sample statistics, L and U. The probability that the interval formed by L and U 
contains the true value of a parameter is | — &. For instance, to construct an interval estimate on the parameter 
6, we find two statistics, L and U, such that 

P(LSO<U)=1-a 8) 

The resulting interval [L < 0 < U] is called a 100(1 — «)% confidence interval on unknown parameter 0. 
If a large number of independent samples were taken from the population being considered, approximately 
100(1 — a&)% of the intervals formed would be expected to contain the true value of 8. Confidence intervals 
that contain both an L, or lower side, and a U, or upper side, are called two-sided confidence intervals. 
Confidence intervals that contain only a lower or an upper side are called one-sided confidence intervals. 
A one-sided 100(1 — «)% confidence interval on @ is given by 

GE's) 
with the probability property 
P(LS8)=1-a (5.2) 

Similarly, the one-sided upper 100(1 — «)% confidence interval on 8 is given by 8 < U with the probability 
property. 

P@<U)=1-a Gp) 

The length of the observed confidence interval is an important measure of the quality of the information 
obtained from the sample. The half-interval length 8 — L or U — @ is called the accuracy of estimator. The 


longer the confidence interval, the more confident we are that interval actually contains the true value of 0 
and the less information we have about the true value of 0. 


Example E5.1 


The following readings give the weights of 6 persons (in kg) picked at random from college students in a 
particular college: 50, 52, 55, 60, 65 and 70. Find estimates of the following: 


(a) the true (population) mean weight of all the students 
(b) the true variance of weights of all the students 
(c) the true standard deviation of weights of all the students. 


SOLUTION: 


-- espe 
iC = —= Pas = 58.6667 kg 


2(X; —X)° = (50-58.6667)? + (52 - 58.6667)? +-+- = 303.3333 
X(X;-X) _ 303.3333 
nl 6-1 


Standard deviation = «/ variance = 60.6667 = 7.7889 kg 


(a) The estimate of the true mean weight is X , that is, 58.6667 kg 
(b) The estimate (in kg’) of the true variance is 


Variance = 


= 60.6667 kg’ 


X(X;,- XY 


n- 


= 60.6667 


(c) The estimate (in kg) of the true standard deviation is 60.6667 = 7.7889 kg 


Example E5.2 


A simple random sample of 44 engineers in New York city yielded the following data on their monthly income 
from employment (in thousands of dollars): 


8 14 8 11 7 4 a gi 2 
12 6 7 f 4 8 12 
12 6 9 6 5 i 3 10 10 
2 8 8 10 >) 6 3 8 ll 


4 5 if 6 11 i 6 8 


(a) use the data to obtain a point estimate for the mean monthly income of all engineers working in 
New York city 


(b) is your point estimate in part (a) likely to equal t exactly? 


SOLUTION: 
(a) XX,=334, andn=44 
Hence X =334/44=7.6 


(b) tis not likely that X is exactly equal to Ls (population mean). Some sampling error is to be expected. 


5.3 CONFIDENCE INTERVAL ON MEAN, VARIANCE KNOWN 


This section presents methods for using sample data to find a point estimate and confidence interval estimate 
of a population mean. The key requirement in this section is that in addition to having sample data, we also 


know o, the standard deviation of the population. 
For obtaining a confidence interval for 1, in this section, we make the following assumptions: 
1. The sample consists of n independent observations, that is, any one observation does not influence 
any other. 
2. The underlying population is normally distributed with mean 1, which is, of course, unknown 
(n> 30). 
3. The population standard deviation o is known. 
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Let X be a random variable with unknown mean ut and known variance 6°, and suppose that a random 
sample of size n, X,, X,, .... X, is taken. A L00(1 — 0)% confidence interval on [1 can be obtained by considering 


the sampling distribution of the sample mean X. The mean of X is wand the variance 0’/n. Therefore, the 
distribution of the statistic 


el 
o/vn 


is taken to be a standard normal distribution. 


Tam 


The distribution of Z = (x = 1)/(o/vn) is shown in Fig. 5.1. From this figure we note that 


X- 
or 1 dN VL ap, =Z, 15 n= 1 
| a/ o/Vn on} 
This can be arranged as 
PAR = Zy0/Vn Ss X + 4).6/Vn} = 1-o1 (5.4) 


From Eggs. (5.1) and (5.4), we note that the 100(1 — &)% two-sided confidence interval on p is 


eye pons igsy. co vaynod Ae (5.5) 


Fig. 5.1: The distribution of z 


From Eq. (5.4), we observe that X will differ from u by at most oa 7 with probability (1 — a). The 
n 


CG . 
quantity </> a is, therefore, called the maximum error of estimate of i at the (1 — &)100 per cent level 
n 


The maximum error of estimate of (1 is commonly called the margin of error or the sampling error. 
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Hence the margin of error of estimate of lu at the (1 — «)100 per cent level is the maximum error in 
estimating and is given by 


2 te) 
margin of error = Zy/2 a 


= Gh ke So 
The interval [x ie eG: A ae a is a random interval and the probability that it covers the 
i 


mean | is (1 — @). The end points 


= fo) 

L=X err, 

and U=K tu -= 
Vn 


of the interval are random variables. Their values depend on the value of X which, in turn, depends on the 
sample values. 


Therefore, a confidence interval for 1 is often given simply by specifying the limits as 


Xt 


Oo 
Za/2 
Vn 
where X is the sample mean based on n observations. 


The above construction of the confidence interval for 4 was based on the assumption that the 
population is normally distributed. This assumption was necessary since it permitted us to proceed by 


stating that X na <= | has a normal distribution. If 1 is large (at least 30), the assumption of a normal 


o/vn 


oe No 
distribution is crucial, because the central limit theorem allows us to proceed by stating that — has 
O/vn 


approximately a normal distribution (we still get the same confidence limits given above, but now they are 
approximate limits). When we say that the confidence interval is exact, we mean that the true confidence 
level equals 1 — a; similarly, when we say that the confidence interval is approximately correct, we mean 
that the true confidence level only approximately equals 1 — © 

The procedure to find a confidence interval for a population mean when the standard deviation is 
known is shown in Table 5.1. 


Example E5.3 


Refer to Example E5.1. Construct a 90% confidence interval for , the true mean weight of the students. 


SOLUTION: 
From Example E5.1, we have 
n=6, X =58.6667 kg and o = 7.7889 kg. 
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For a 90% confidence interval, o = 0.10. Thus, z,,,, = Zo; = 1-645 (from the table in Appendix-£). The confidence 
interval is given by 


7.7889 
77889 < 11 < 58.6667 +1.645 
V6 V6 


or 58.6667 — 5.2308 < pt < 58.6667 + 5.2308 
53.4359 su < 63.8975 


which can be simplified to 53.44 < u < 63.90, giving the confidence interval (53.44, 63.90). Hence, with 90% 
confidence we estimate that the mean weight u of the students in that college is between 53.44 and 
63.90 kg. 


The term normal population is used here as an abbreviation for “the variable under consideration is 
normally distributed”. The z-interval procedure works reasonably well even when the variable is not normally 
distributed and the sample size is small, or moderate, provided the variable is not too far from being normally 
distributed. For large samples, say, of size 30 or more (n = 30), the z-procedure can be used essentially 
without restriction. For samples of moderate size (15 <n < 30), the z-interval procedure can be used unless 
the data contains outliers or the variable under consideration is far from being normally distributed. For 
small samples, say, of size less than 15 (nm < 15), the z-interval procedure should be used only when the 
variable under consideration is normally distributed or very close to being so. If outliers are present then 
their removal is justified in a data set in order to use the z-interval procedure. 


58.6667 — 1.645 


Table 5.1: Procedure to find a confidence interval for a population mean when o is known 


Assumptions: 

1. Simple random sample 

2. Normal population or large sample 
3. oknown 


Step 1: For a confidence level of 1 — a, use the table in Appendix-E to find z,,. 
Step 2: The confidence interval for is from 


- 0) = oO 
OFS aN to X +z == 
‘OL Vn a /2 Jn 
where Z,,2 is found in step 1, n is the sample size and X is computed from the sample data. 
Step 3: Interpret the confidence interval. 


The confidence interval is exact for normal populations and is approximately correct for large samples 
from non-normal populations. 


Example E5.4 


Refer to Example E5.2. Assume that the recent monthly income of engineers from employment in New York 
city are normally distributed with a standard deviation of $8100. 


(a) determine a 95.44% confidence interval for the mean cost, u, of all recent engineers in New York 
city 
(b) interpret your result in part (a) 


(c) does the mean monthly income from employment of all engineers in New York city lie in the 
confidence interval obtained in part (a)? 


Explain your answer. 
SOLUTION: 


(a) The confidence interval is given by 


ats oO = 
oO wg to K+ 20). 
n 


where Xp = 2.0 (from the table in Appendix-F). 


Therefore 7.6 -2 aa) to 7.6+ 7 2-4) 
a4 Ya 
or 6.9 to 8.3 


(b) Since we know that 95.44% of all samples of 44 engineers employment monthly income have the 


property that the interval from X — 0.7 to X + 0.7 contains u, we can be 95.44% confident that the 
interval from 6.9 to 8.3 contains UL. 


(c) The confidence interval in part (a) would be exact if the population of engineers’ monthly 
employment income were exactly a normal distribution. However, since engineers’ monthly 
employment income is a discrete random variable and the normal distribution is continuous. monthly 
employment income cannot follow a normal distribution exactly. 


Example E5.5 


It has been established that in a certain packaging process, the packages are of an average weight of yt but 
that 4 changes over time as the process adjustment changes through wear. It was also found that the 
variance of the package weight is a constant at 16 kg even though the mean changes overtime. A sample of 


36 packages has revealed their mean weight X of 100 kg. Assuming that the weight of the individual 
packages are normally distributed around p, (a) find the 90% confidence limits for , (b) find the 95% 


confidence limits for pL. 


SOLUTION: 
(a) Here 1-—a@=0.90 ora=0.10 and o/2 = 0.05. 
Zyos = 1.645 (from the table in Appendix-£), «= V16 =4kg 


By computation, 


612 _ 499 1.645(4) 


L=X- 
vn 36 


= 100-—1.0967 = 98.9033 


yy = % + 20l2® — 199+ LOE | 1004 1.0967 = 101.0967 
36 
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Hence, we are 90% confident that m lies between 98.9033 and 101.0967 kg. 
(b) Here z,, = 1.96 (from the table in Appendix-E£) 


L= x — 2222 = 100- ail = 100- 1.3067 = 98.6933 
Vn 36 
U = ¥ + <al2® _ 1994 122) _ 10041.3067 = 101.3067 


Vn V36 


Therefore, we are 95% confident that [1 lies between 98.6933 and 101.3067 kg. 


One-sided Confidence Intervals 


One-sided confidence intervals for U1 are obtained by setting either L = —co or U = ~ and replacing z,, by z 
The 100(1 — o)% upper-confidence interval for is 


a 


usX+z,0/Vn (5.6) 


and the 100(1 — «)% lower-confidence interval for 1 is 


X -z,0/Vn<p (5.7) 


It is most preferable to obtain and pinpoint the parameter with 100% certainty. Due to the variability inherent 
in the population, it is hard to accomplish this. One can try to achieve the turn goal of a high level of 
confidence and a narrow interval. 


Length of the confidence interval for ju: The length of any interval from a to b is (b — a). Hence, the length 
of the confidence interval is 


Therefore, the length of the confidence interval for u 


= 2 (margin of error) 


= 27 = 
a/2 AE 
The length of the confidence interval is also called its width. 


Hence, the length of the confidence interval is twice the margin of error or the margin of error is one-half 
the length of the confidence interval. 


We also note that the length of the confidence interval for 4 does not depend on X, but it depends on z__,, 
6, and n. Hence, in order to achieve a high accuracy in estimating p (that is, a narrow confidence interval 
with high degree of confidence) then one way to accomplish this goal is to pick an appropriately large 
sample. The margin of error is the standard error of the mean multiplied by Zo Lhe length of a confidence 
interval, and thus the precision with which X estimates 1, is determined by the margin of error. Increasing 
the confidence level while keeping the sample size the same will increase the value of Zz. and hence the 
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length of the confidence interval (decrease the precision of the estimate). Increasing the sample size while 


keeping the same confidence level will decrease the margin of error and the length of the confidence interval 
(increase the precision). 


Example E5.6 


A random sample of 18 venture-capital investments in the fiber optics sector yields the following data. in 
crores of rupees: 


2.04 5.48 5.60 5.96 6.27 10.51 
4.13 5.58 5.74 5.95 6.67 8.63 
421 4.98 6.66 oe7| 8.64 9.21 


(a) determine a 95% confidence interval for the mean amount, u, of all venture. capital investments in 
the fiber optics sector. Assume that the population standard deviation is 2.04 crores of rupees 


(b) interpret your finding in part (a) 

(c) find a 99% confidence interval for u 

(d) why is the confidence interval found in part (c) longer than the one in part (a)? 
(e) which confidence interval yields a more precise estimate of .? Explain your answer. 


SOLUTION: 
(a) n= 18 and XX = 113.97 crores 
Sex 3.97 
xX =— = —— = 6.33 crores 
n 18 


The 95% confidence interval for | is 


= Oo = ie} 
n vn 


nN 
on 1.20 (from the table in Appendix-£), 
: 2.04 
63321196 oe ws 633-41.96 ee 
vi8 18 
or 5.39 to 7.27 crores. 


(b) Wecan be 95% confident that the interval from 5.39 crores to 7.27 crores contains the population 
mean venture capital investment in the fiber-optics sector. 

(c) n=18, X =6.33 and o=2.04 
of = 0.01 and Z,,, = Z, 995 = 2.575 (from the table in Appendix-E), 


= 10) — 0) 

—70,,5—= to X +2). = 

Hence xX a/2 in 1/2 vn 
2.04 (2.04) 
6.33 — 2.575 ( ) fmo.oo +20). = 


a Vis 


or 5.09 to 7.57 crores. 
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(d) The confidence interval in (c) is longer than the one part (a) because we have changed the 
confidence level from 95% in part (a) to 99% in part (c). We notice that increasing the confidence 
level from 95% to 99% increases the z,,, value from 1.96 to 2.575. The larger z-value, in turn, results 
in a longer interval. In order to accomplish a higher level of confidence that the interval contains 
the population mean, we need a longer interval. 


(e) “Seekig7E5.6. 
The 95% confidence interval is shorter and therefore provides a more precise estimate of LL. 


95% confident that y lies here 


ae 


5.39 Ut 


99% confident that 1 lies here 


5.09 Uso 


Fig. E5.6 


54 CONFIDENCE INTERVAL ON THE MEAN OF A NORMAL 
DISTRIBUTION, VARIANCE UNKNOWN 


The method presented here is valid for any arbitrating sample size but is particularly important when the 


sample size is small. For setting a confidence interval for 1, we make the following assumptions: 


1. The sample is a simple random sample. 


i) 


The observations are picked from the population under study and are independent. 

3. The population has a normal distribution. 

Here we find a confidence interval on the mean of a distribution when the variance is unknown. Suppose 
a random sample of size n, X,, X,, ..., X, is available and x and S° are the sample mean and sample variance, 
respectively. In order to use a valid confidence interval when the sample size small, we make an assumption 
that the underlying population is normally distributed. This leads to confidence intervals based on the 


t-distribution. Therefore, let X,, X,, ..., X, be a random sample from a normal distribution with unknown 
mean t and unknown variance o*. We know the sampling distribution of the statistic 


Ns 
SiNn 


is the t-distribution with n — 1 degrees of freedom. 


t 


The distribution of t = (x = ut) / (s INn ) is shown in Fig. 5.2. Letting t,, ,, be the upper 0/2 percentage 


point of the ¢-distribution with n — 1 degree of freedom, we note from Fig. 5.2, that 


Pl tin et Sty y= 18 


X-p ; 
or alee anes Sty, =1-a 5.8) 
| o/2, n-1 Ce w/2, | ( 
Rearranging Eq. (5.8) 
PYX ~tainaSIVn SWs X + tas2,n-18 Vn} = 1-0 (5.9) 


Comparing Eqs. (5.4) and (5.9), we see that a 100 (1 — O)% two-sided confidence interval on UU is 


Mee Sin swaX +4, Siva (5.10) 


A 100 (1 — a) % lower-confidence interval on p is given by 


Meaty Sin <i (5.11) 


and a 100 (1 — &) % upper-confidence interval on pi is given by 


oe es in (5.12) 


The above method assumes that the samplings are from a normal population. 


The procedure to find a confidence interval for a population mean when the standard deviation o is 
unknown, is given in Table 5.2. Properties and guidelines for use of the t-interval procedure are the same as 
those for the z-interval procedure. The f-interval procedure is robust to moderate violations of the normality 
assumptions. 

Probability density 


ae curve of t-distribution 
with n — 1 df 


Sih 
Se Oa o/2 ne ,a/2 


Fig. 5.2: t-values such that there is an area o/2 in the right tail and 
an area o/2 in the left tail of the distribution 
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Table 5.2: Procedure to find a confidence interval for a population mean when o is 
unknown 


Assumptions: 
1. Simple random sample 

2. Normal population or large sample 
3. ounknown 


Step 1: | For a confidence level of 1 — a, use the Table in Appendix-G to find ty. with df =n — 1, 
where n is the sample size 
Step 2: The confidence interval for 1 is from 


- 5 = S 
et — — ON es — 


Vn vn 


where fy,2 is found in step 1 and X and S are computed from the sample data. 
Step 3: Interpret the confidence interval. 


The confidence interval is exact for normal populations and is approximately correct for large samples 
from non-normal populations. 


Example E5.7 


Find the following confidence intervals for 1, assuming that the populations of paired differences are 
normally distributed. 


(a) n=9, X =25, S = 13, confidence level 99%. 
(b) n=26, X = 13, S=5, confidence level 95%. 
(c) n=12, X = 35, S= 12, confidence level 90%. 


SOLUTION: 
(a) The t-value for v=n-1=9-1=8d.0.f. and 0.005 area in the right tail is 3.355 (from the table in 
Appendix-G). The 99% confidence interval with t = 3.355 is 


0.005,8 
x ~tayo,yS/Nn S us x + tyj2ySivn 


25 —3.355(13)/V9 < p< 25+3.355(13)V9 
25 — 14.538 << 25 + 14.538 
10.462 < w< 39.538 
The complete statement of the confidence interval, with the associated probability is 
P(10.462 < pp < 39.538) = 0.99 
(b) Using the data given, a 95% confidence interval with tgnos 
is given 
13- 2.060(5)//26 Su<13+ 2.060(5)/.426 
13 ~2.02 <p < 13 + 2.02 
10.98 <p < 15.02 


= 2.060 from the table in Appendix-G, 
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The complete statement of the confidence interval with the associated probability is 
P(10.98 < wt < 15.02) = 0.95 


(c) Using the data given, a 90% confidence interval with to95.1; = 1-796 from the table in Appendix-G, is 
given by . 
35 —1.796(12)/V12 < uw <35+1.796(12)/V12 


35 -0.16 <u <35+0.16 
34.84 <u <35.16 


The complete statement of the confidence interval with the associated probability is 
P(34.84 < u < 35.16) = 0.90 


Example E5.8 


Measurements on the percentage of enrichment of 12 fuel rods were reported as follows: 


2.94 3S 29 215 3 ahs 
2.19 2.95 292 2.81 3.05 2.8 


Find a 99% two-sided confidence interval on the mean percentage of enrichment of fuel rods. Can we state 
the mean percentage of enrichment is 2.95%? Why? 


SOLUTION: 
Here n= 12 and XX = 34.82 


x = 2X _ 34-82 _ 5 917 
n 2 


Alsoey D(X, — X)* =05112567 


0) 
Hence S= zie Ten = A eoncaa = 0.10116 
n-1 12-1 


The 99% two-sided confidence interval on the mean per cent enrichment is found from 


a Ss = Ss 
X —t9.005,11 (=| SUSX + bo 0511 (=| 


For a =0.01 andn=12,t,, = = 3.106 (from the table in Appendix-G). 


0,005, 11 


0.10116 0.10116 
: — 3.106 < < 2,9017+-3.106{ 
Therefore 2.9017 -—3 cin rv a 


2.9017 — 0.896625 


or 2.005 < p < 3.798325 | 
We can state that the mean percentage of enrichment 2.95 is included in the 99% two-sided confidence 


interval. 
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5.5 CONFIDENCE INTERVAL ON THE VARIANCE OF A NORMAL 


DISTRIBUTION 


Here, we accept S? as providing a good estimate for 6°. For setting a confidence interval for o°, we make 
the following assumptions: 
1. The observations are independent. 
2. The parent population has a normal distribution. 
3. The sample is a simple random sample. 
Suppose that X is normally distributed with unknown mean [t and unknown variance GO. UetX Xo. 
be a random sample of size n, let S° be the sample variance. We note that the sampling distribution of 


is chi-square with n — 1 degrees of freedom. This distribution is shown in Fig. 5.3. 


Probability density curve 
of chi-square distribution 
with n— 1 df 


o/2 


2 
~1,1-0/2 Xn-1,1-a/2 


Fig. 5.3: Chi-square values such that areas 1 — o/2 and o/2 are to their right 


We observe from Fig. 5.3 that, 


eee a Ge eee, =1-a 


2 n-1)S? | 
or Pi aaa = — = | er Ch (5.13) 


Equation (5.13) can be rearranged to give 


(2=DS? ¢ gr < a-1)S? | _ 
; <o ae (5.14) 


> 


Xel2,n-1 Gane 

Comparing Eqs. (5.14) and (5.4), we see that a 100 (1 — a) % two-sided confidence interval on 6? is 
2 ae 

DS? ¢ 42 < n=DS 


7 (5.15) 


(n— 
oe D 
Xo/2,n-1 X1-a/2.n-1 


when S~ is the sample variance based on n observations from a normal population. 
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In order to find a 100 (1 - «)% lower-confidence interval on 6°, set U = co and replace ‘ia with Nien iis 
giving 


el) 5” < o : 
Xo I (9.16) 


The 100 (1 — «)% upper-confidence interval is found by setting L = 0 and replacing Kemet, eth 


: 
Xi-a,n—-1 , resulting in 


2 (5.17) 


ineall thetabove Eqs. (5.13 to 5.17), eons a/2 and eas represent values from the chi-square 
distribution with m— 1 degrees of freedom such that they leave, respectively. area of | — /2 and @/2 to 
their right, as shown in Fig. 5.3. The procedure to find a confidence interval for a population standard 
deviation is given in Table 5.3. 


Table 5.3: Procedure to find a confidence interval for a population standard deviation, o 


| Assumptions: 
1. Simple random sample 
2. Normal population 


Step 1: For a confidence level of 1 — a, use the table in Appendix-F to find y,, .and x, . with 


df=n-1 
Step 2: The confidence interval for o is from 
CS ae u =I) 5 
Xa/2 Xi-a/2 


where X%;,,,. and X;,,, are found in step |. 1 is the sample size and S is computed from the 


sample data obtained 
Step 3: Interpret the confidence interval. 


Example E5.9 


Data are collected on the driving time required to reach place A from place B. The driving time is assumed 
to follow a normal distribution. Twenty-one driving times are collected and the sample variance is calculated 
to be 2 hours. A 99% confidence interval on the true variance is desired. 


SOLUTION: 
The two-sided confidence interval is given by (Appendix-F) 


(n= DS? — 2 < (H=DS? 


iar 2 
Xi-a/2,v 


B 
Xa/2.v 


where v=n-—1. 
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From the table in Appendix-F with v = 21 — 1 = 20 and o/2 = 0.005, we find that Xo.005.20 = 39.997 = %0.995.20 
= 7.434. 
Therefore, 
Ci eeey 2 ISDE) or 2<o0* <10.761 
39.997 7.434 
The complete statement of the confidence interval with the associated probability is 


P(2 hrs < 0? $ 10.761 hrs) = 0.99 


Example E5.10 


The capability of a specific gauge can be studied by measuring the weight of paper. The data for repeated 
measurements of one sheet of paper are given in Table E5.10. Construct a 95% one-sided, upper confidence 
interval for the standard deviation of these measurements. 


Table E5.10 


SOLUTION: 
A 95% upper-confidence interval is found from Eq. (5.17) as follows: 
2 
ee < (n — 1S 


2 
X0.95.14 


n= 15, S = 0.001354 and Wands = 6.57 (from the table in Appendix-F). 
DX, = 52.094, 2(X, -X)? = 0.000901 


HOG av or 
S= yo = 0.001354 and S? = 1.83 x 10° 
n> 


o? = 140.83 x10°) 
6.57 
o? < 3.9073 x 10° 


The above can be converted into a confidence interval on the standard deviation o by taking the square 
root of both sides, resulting in o = 0.00197667. 


Therefore, at the 95% level of confidence, the data indicates that the process standard deviation could be 
as large as 0.00197667. 


5.6 CONFIDENCE INTERVAL ON A POPULATION PROPORTION 


We will now give an interval estimate for population proportion under the following assumptions: 
1. The sample consists of n independent observations. 
2. The sample size is large. 


3. The population proportion is not too close to 0 or 1. 
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If a random sample size n has been taken from a large (possibly infinite) population, and X(< 1) 
observations in this sample belong to a class of interest, then p= X/n is the point estimator of the 
proportion of the population belonging. n and p are the parameters of binomial distribution. The sampling 
distribution p is approximately normal mean p and variance p(1 — p)/n, if p is not too close to either 0 or 1. 
and if n is relatively large. Therefore, the distribution of 


z-—P=P 


[pQ=p) 
n 


is approximately standard normal distribution. 


We also observe that 


a2 o/2 
or Pee ee ey | 0 (5.18) 
Play) 


Rearranging Eq. (5.18) 


re p(l- p) e = 
Pl pcan B p =) pcan POP) 1 (5.19) 


The quantity of ./p(l— p)/n is called the standard error of the point estimator p. Replacing p by p 
in the standard error, giving an estimated standard error. 


[PU- 2B [pap 
Pps ao Do pe SS (5.20) 


The above formula, Eq. (5.20), is applicable only if the observed sample proportion is not too close to 
O or 1 and the observed number of successes X and the observed number of failures (1 — X) both exceed 5. 
If these conditions are not met, the procedure is not recommended. 


The above procedure depends on the adequacy of the normal approximation to the binomial. 
Conservatively speaking, this requires that np and n(1 — p) be greater than or equal to 5. In situations 
where the approximation is inappropriate (especially when n is small) other methods must be used. Tables 
of binomial distribution could be used to obtain a confidence interval for p. 


We may find approximate one-sided confidence bounds on p by a simple modification of Eq. (5.20). 


The approximate 100(1 — «)% lower and upper confidence bounds are 
; [pd-p a [PA p) 
P= Se P<, and pS pt Z, z 
respectively. 


The procedure to find a confidence interval for a population proportion, p is given in Table 5.4. 


lA 


222 // Probability and Statistics for Scientists and Engineers // 


Table 5.4: Procedure to find a confidence interval for a population standard proportion, p 


Assumptions: 


1. Simple random sample 
2. The number of successes, x, and the number of failures, n — x are both 5 or greater 


par 


Step 1: | For a confidence level of 1 — a, use the table in Appendix-E to find Zo 
Step 2: The confidence interval for p is from 

p(1— p) ‘ p(1— p) 

see is Pt Za/2 is 


A 


P~ Zo12 
n n 
where z,,2 is found in step 1, 7 is the sample size and p= x/n is the sample proportion 
Step 3: Interpret the confidence interval. 
Example E5.11 


In a random sample of 300 automobile crankshaft bearings, 12 have a surface finish that is rougher than the 


specifications allow. 
(a) Calculate a 95% two-sided confidence interval on the fraction of defective crankshafts produced 


by this particular process. 
(b) Calculate a 95% upper confidence bound on the fraction of defective crankshafts. 


SOLUTION: 
(a) 95% confidence interval on the fraction defectives produced with the process. 


p = 2. =0.04,n = 30, z,, = 1.96 (from the table in Appendix-E). 


300 
i, pip) , P(— p 
P= Zei0 ee SPS. pt ae AP) 
n n 
0.04(1 — 0.04) 0.041 — 0.04) 


0.041.964 ee eioeiion 
300 P 300 


0.01 — 0.022175 < p< 0.04 + 0.02275 
0.017825 < p < 0.062175 

(b) 95% upper confidence bound: 
Zu = 2y95 = 1.65 (from Appendix-E) 


p< 0.04+1.65 tiles Pee) 
300 


or < 0.04 + 0.018668 
Hence p<0.058668 
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Example E5.12 


A sample of 300 of electronic components produced showed 36 non-conforming ones. Construct a 95% 
confidence interval for this. Assume binomial distribution. 


SOLUTION: 


A two-sided confidence interval on p is given by 


,_.  |PU=B) : p— p 
DP <e3 ‘4 =P = Pp sae : i 


Since n is large and the underlying distribution is binomial, p = 36/300 = 0.12 


A 95% confidence interval, with Z);= 1.96 (from the table Appendix-E) is given by 


0.12 -—1.96 pees) << 012196 CO O88) 
300 ane 
. 0.12 ~ 1.96 (0.01876) <p < 0.12 + 1.96 (0.01876) 


0.12 — 0.0368 < p< 0.12 + 0.0368 

0.0832 < p < 0.1568 
The complete statement of the confidence interval with the associated probability is 

P(0.0832 < p < 0.1568) = 0.95 
Margin of Error: 
The margin of error from Eq. (5.20) is given by 

p(l— Pp) 
n 


p= P—)—. 1 eos) ‘ 
It can be shown that the maximum value of <y,/2 . 1S 2q)2 Te and occurs when p= Fe: This 


called the safe or conservative margin of error, whatever the actual value of p in the population. 


margin of error = Z,/2 


1 


Hence, conservative margin of error = Z,/2 ee 


Example E5.13 


Determine the margin of error in Example E5.12. 


SOLUTION: 

The margin of error is given by 
p(l- Pp) 
margin of error = Z,/2 ee 


Z,» = 1.96 (from the table in Appendix-E), p = 36/300 = 0.12 
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(0.12 jis=l0 ED 
Hence, margin of error = 1.96, a = 0.0368 


5.7 CONFIDENCE INTERVAL ON THE DIFFERENCE IN TWO MEANS, 


VARIANCE KNOWN 


The procedure presented in this section for finding a confidence interval is valid if the following assumptions 
are justified: 
1. The two populations are normally distributed. This assumption is not very important if both sample 
sizes are at least 30 (large sample size). 
The standard deviations of the two populations are known. 


3. Two random samples are picked, one from each population. They are independent, that is, any 
outcome in one sample does not influence any outcome in the other sample. 


4. Within each sample the outcomes are independent. 
Consider two independent random variables X, with unknown mean y, and known variance Gi and X, 
with unknown mean iL, and variance G; . We wish to find a 100 (1 — «)% confidence interval on the difference 


in means Pe U- Der Ge see Xin, be a random sample of n, observations from 2 and A. 


x aan Xn, be 


a random sample of n, observations from X,,. If x , and y , are the sample means, the statistic 
X; — X. — (Uy =U) 
Cee 
~ = 
mh Ny 


hem 


is standard normal if X, and X, are normal or approximately standard normal if the conditions of the central 
limit theorem apply, respectively. Also, 6, and 6, are the population standard deviations. From Fig. 5.2 it is 
clear that 


or 


where z,,, is the upper O/2 percentage point of the standard normal distribution. 


Rearranging, we have 


2 2 2 2 
- = 0, Oo = s fy 0 
X,—X>— Zoli — Sp, —~Deee Xe eee 
- Ale n, Ny 1 Ls 1 2 a/2 n, Ny 1 Q (521) 


Comparing Eggs. (5.4) and (5.21), we note that 100(1 — «&)% confidence interval for Ht, -H, is 


P 
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a ee —— 
ey Gea 1 2 SS oP Oo 
By - Xa— Za +—* SW My SX, —X_ 4+ Zy)9,/ ++ (5.22) 
Le mh Nh 


One-sided confidence intervals on U, — MW, may also be obtained. A 100 (1 — «)% upper-confidence 
interval on |, — LL, is 


2 2 
— GS 
My -H, SX, -X,4+2,,J++ : 
1 1 27 <q mt (5.23) 
and a 100(1 — &) % lower-confidence interval is 
2 2 
= Gia 
ee tl, IL, (5.24) 
m Ny 


The confidence level (1 — ) is exact when the populations are normal. For non-normal populations, the 
confidence level is approximately valid for large sample sizes. 


The margin of error E is 


2 2 
oO (e} 

E = Z)9,,—-+ = (5.24a) 
nh Ny 


Example E5.14 


The mean hourly wage for male workers was $15 and $13 for female workers in a particular manufacturing 
sector in the year 2000. These two estimations were based on random samples of 1000 and 1200 workers 
taken, respectively from two independent populations. The standard deviations of the two populations are 
known to be $2 and $1.50 respectively. Construct a 95% confidence interval for the difference between the 
mean hourly wages of the two populations. 


SOLUTION: 
For the 95% confidence level z,,, = 1.96 (from the table in Appendix-F). 
The 95% confidence interval for p14, — 1, is given by 


2 2 
X,-X, 421 1422 
m Ny 
Pie Nil 
15-13+1.96 + =2 + 1.96 (0.2046) = 2 + 0.4011 = $1.60 to $2.40 
1000 1200 


We have obtained ($1.60, $2.40) as an 95% confidence interval for |, — },. 


Two different formulations of oxygenated motor fuel are being tested to study their road octane numbers. 


. , 5 2 . or — 4 
The variances of road octane number for formulation A is 67 = 1.5 and for formulation B it is 6, = 1.2. Two 


random samples of sizes n, = 15 and n, = 20 are tested, and the mean road octane numbers observed are 
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X, =90 and X, = 93. Assume normality and calculate a 95% confidence interval on the difference in 
means. 


SOLUTION: 
95% confidence interval: & = 0.05, z,,, = Zo. = 1.96 (from the table in Appendix-E). 


‘0.025 


ae Io? co 
(X, ~ X2)=<ara a =, — ls 
ny As 
LS ie LESieelee 
— J! — + <u, -—u, <90-—93+4+1.96,/— + — 
ae 1529 01 1 15 20 


3 — 0734S eo 04 
or 3.784 Sl, —p, S—2.216 
With 95% confidence, we believe the mean road octane number for formulation B exceeds that of formulation 
A since 0 is not included in the confidence interval. 


5.8 CONFIDENCE INTERVAL ON THE DIFFERENCE IN MEANS OF 


TWO NORMAL DISTRIBUTIONS, VARIANCES UNKNOWN 


We make the following assumptions: 


1. The two populations are normally distributed. 
2. The population variances o; and o;, though unknown are the same, say, each equal to 0”. 
3. Two random and independent samples are picked, one from each population. 


Consider two independent normal random variables, say X, with mean yw, and variance G; , and X, with 
. Z ry 
mean , and variance 65. Both the means 1, and 1, and the variances oO; and 6; are unknown. Assume 


that both variances are equal; that is o7 = Oo; = 0°. We wish to find a 100 (i — @)% confidence interval on 
the difference in means LL, — U,. 

Random samples of size n, and n, are taken on X, and X,, respectively; let the sample means be denoted 
by X, and X,, and the sample variances be denoted S; and Ss: . Since both Se and §5 are estimates of 


the common variance o*, we may obtain a combined (or “pooled”) estimator of 6? as the pooled estimate of 
the common population standard deviation, \ is given by 


(n, -1)S; +(n, -1)S3 
+ ie 


2 
= 


(5.25) 


To find the confidence interval for pt, — lt,, note that the distribution of the statistic 


X,- X2-(Wy—p) 


| ines 
Sa 
mh nN 


C= 
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is the t-distribution with n, +n, —2 degrees of freedom. Hence, 
Py eto ing StS tyson, 4n,-2 } alse 


or Pet 


o/2,n, +n, —2 S 


Here t/2,,,4n,-2 is the upper o/2 percentage point of the r-distribution with Hh, +n, — 2 deaiee sor 
freedom. 


Rearranging, 


— ae 1 1 = 1 | 
PRs -taraaan Spot mite AT eh iia S, on =I-c@ (Ore) 
1 2 ; 1 


Hence, a 100 (1 — «)% two-sided confidence interval on the difference in means Ue ts 


aa et - = lee 
Ai X32 — by /2am+n,-2 p aoe SU —b2 SX, ~- Xp + ty./2.m,4m-25 p sn Pe C27) 
l 2 1 2 


A one-sided 100 (1 — 01)% lower-confidence interval on 1, — LL, is 


ae | ae 
MAD — lo sn,—29 p ae, SM — Wo (5.28) 
1 Ne 


and a one-sided 100 (1 — &)% upper-confidence interval on , — 1, is 


= bel 
by, —U> SS XxX ms X2 Sein 29 p ran (5.29) 
1 2 


The margin of error E is given by 


Lat 
ey te 
my 
Since 6 is not known, we replace it with its pooled estimate S. and thereby we cannot use <-values 
from the normal table. Here, we use values from the f-distribution with n, + n, — 2 degrees of freedom. This 
is especially important if the degrees of freedom are less than 30 because then the t-values will be markedly 


different from the z-values. Hence, the marginal of error is 
| 


LN) SRT Ae a 
nm +n, — 4,0 P ny N> 


4 a (n, —1)S? + (nm) -1)S3 _ Sei a ae) 
gees oe hi, —2 Cini tae 
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where X, and X, are the sample means, based, respectively on n, and n, observations, and S, is the 
pooled estimate of o. The procedure to find a confidence interval for the difference between two population 
means, HL, and pL, is given in Table 5.5. 


Table 5.5: Procedure to find a confidence interval for the difference between two 
population means pt, and up, 


Assumptions: 

1. Simple random samples 

2. Independent samples 

3. Normal populations or large samples 
4, Equal population standard deviations 


Step 1: For a confidence level of 1 — a, use the table in Appendix-G to find fg. with df= nj, +n — 2 


Step 2: The end points of the confidence interval for [4; — [Ap are 

a ee tot 

(Xia lt tans, SS 

mh % 
3 3 

_ + — 

where S,, = (ae 
n +n, —2 


Step 3: Interpret the confidence interval. 


The confidence interval is exact for normal populations and is approximately correct for large samples 
from non-normal populations. 


Example E5.16 


The overall distance travelled by a golf ball is tested by striking the ball with a specially made mechanical 


golfer. Ten randomly selected balls of two different brands A and B are tested and the overall distance 
measured as shown below: 


BrandA 263 m7 27 eS 275 216 219 283 286 287 
Brand B 244 250° 261 9263 265 268 270 Pg 273 280 


Construct a 95% two-sided confidence interval on the mean difference in overall distance between the two 
brands of golf balls. 


SOLUTION: 
Here nn, = 10, n, = 10, ZX, = 2760, EX, = 2653 
= Sve =a kay 
5 =e 376 5 pr = EE 
n 10 : Ny 10 


X(X,; -X,)” =564 and X(X,, — X,)* = 868.1 


_y \ 
s2= ma = 62.6667 and S, = 7.9162 


nN, = 
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X(X2; ~ Xz)" 
Ser = 96.4556 and S, = 9.8212 


(n, -1)S; +(n, —1)S3 a /ao- 1)(62.6667) + (LO — 1)(96.4556) 
n +n, —2 10410 = 2 


or S =8.9197 


95% two-sided confidence interval: t/2,.n,4n,-2 = =booosig = 2-101 (from the table in Appendix-G). 


Pee 1 1 1 1 
(Xi — X2)—faramin 2p f—+ — Sth - US <(X, ~ Xo) + tyr2nsn 25> ae ae 
mh NN mh Nn 


5 Ee 1] 
(276 — 265.3) — 2.101(8.9197) 10° 19 ahh ys @76- 265.3) +2.10189197) + 
2.3191 <p, —w, < 19.0809. 


Example E5.17 


To compare the tensile strength of two materials, 12 test specimens of each material were tested. The order 
was decided at random. The higher the test value, the better the strength of these materials. The data 
obtained are summarised in Table E5.20. 


Table E5.20 
9.70 De 10.5 | 9.60 9.40 


ey 


12.5 


8.40 | 11.6 9.70 
SOLUTION: 
For the both materials 
Material A Material B 
ne ee 
2X,, = 124.8 xX, = 108 
—- 2X, 1248 — 2XX,, 108 
a ee a —————— 
aie, 12 oe nn 
Bie 1312 =x? =1010.64 
1 2 
[= (x) =izore2 (2 Py Soe 
nh, ny 


Sean) =1408 nee) Sass 
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Hence, the summary table is 


Degree of freedom 
14.08 
| 


Total: 52.72 


Material 


Therciore, d= X,—X,=1.4 


The pooled estimate of variance is $* = 52.72/22 = 2.4 


The estimated variance of the difference between the means of 


a | 2 
ie -s°(2+4) -24( 5] =0.4 
¢ non 2) 


The estimated standard deviation of d is 


S, =V04 = 0.63 


Hence, if we desire a 95% confidence interval, we must obtain f, ,,.,. 


The 95% confidence interval now be formed using the relationship 


= 2.074 (from the table in Appendix-G). 


L=d-— ty/2.n+n-2 S,= 1.40—(2.074)(0.63) = 0.09 
U=d + ly/2,n,4n,-2 S,= 1.40 + (2.074)(0.63) = 2.71 
We can be a 95% confidence that the difference between the actual means is between 0.09 and 2.71. 


When we cannot assume Oj = Gs, , we can find a 100(1 — &)% confidence interval on 1, — MW, using the fact 
that the statistic 


pe — Xi 7X2 — (Py) 
Stine Se ln 
distributed approximately as ¢ with degrees of freedom given by 
2 
(S? /n, + 8} /n,) 


(s?/m)  (s3/m) (5.30) 
Roa! 3 eed | 


Therefore, an approximate 100 (1 — 1)% two-sided confidence interval on 1, - , when o7 # 0} is 


acpeeuniey co 5 ae 52 52 
X,-X, tajayq[ bt <P s X/- oe (531) 
ee tio) hy 


Upper (lower) one-sided confidence limits may be found by replacing the lower (upper) confidence limit 
with —co (co) and changing 0/2 to a. 


The procedure to find a confidence interval for the difference between two population means, p, and pL, is 
given in Table 5.6. 
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Table 5.6: Procedure to find a confidence interval for the difference between two 
population means 1, and LL, 


Assumptions: 

1. Simple random samples 

2. Independent samples 

3. Normal populations or large samples 


Step 1: For a confidence level of 1 — ©, use the table in Appendix-G to find tuo with df = v, where 
(s? /m)+($3/m) | 
2. a eee 
Si in Sie 
(st/m) (27m) 


nl fit 


rounded down to the nearest integer. 
Step 2: The end points of the confidence interval for 11, — > are 


Interpret the confidence interval. 


Example E5.18 


The administration of a college wants to test if the mean GPAs (grade point averages) of all male and female 
students who actively participate in sports are different. A random sample of 28 male students and 24 female 
students who are actively participated in sports. The mean GPAs of the two groups were found to be 2.00 
and 2.12 respectively, with the corresponding standard deviations equal to 0.43 and 0.38. Assume the GPAs 
of all male and female students who participated in sports activities have a normal distribution with unequal 
standard deviations. Construct a 90% confidence interval for the difference between the two population 


means. 


SOLUTION: 
Given S, = 0.43, S, = 0.38, n= 28, n, = 24 
The desired confidence interval is given by 


(X,- X2)ttan 


where the degrees of freedom 


2 ae 27 
[st,st] [a ,28") 
mn Mh = 28 24 — = 49 (rounded) 


| (sy (es) [eae] 
ny, 28 24 
m) Am 


+ 
n-1 ml 25-1] thier | 


232 // Probability and Statistics for Scientists and Engineers // 


0.90 , 
Area in each tail of the r-distribution curve = 0.5-—— = 0.05. The t-value for df = 49 and 0.05 area in the 


right tail is 1.677 (from the table in Appendix-G). 
The 90% confidence interval for 1, — L, is given by 


So ase 
(2.0 -2.12) £1.677,{—- + 
Mm Ny 


(0.43) n (0.38)° 
28 24 
0.12 + 1.677(0.11234) = -0.12 + 0.19 =-0.31 to 0.07. 


or Onl a 167 7 


Example E5.19 


Assuming that two populations are normally distributed with unequal and unknown population standard 
deviations, construct a 99% confidence interval for 1, — 1, for the following data: 


Nao, A = 50.8, = 3.0, m9 Xe = 40, Seo 


SOLUTION: 


From the table in Appendix-G, f is 2.756. The desired confidence interval is given by 


0.005,22 


as Si. Se 
Ose esas a 
Poe) 
Le 
nn 
where v= z 5 
Ze a 
n H 
le 


Tiel fea 


Substituting the values for S|, S,, n, and n, and simplifying, we get n = 29. Thus, the desired confidence 


interval is given by: 
[3° 6 [3 6° 
50 — 40 — 2.756,/ —+— <p, -p, <50—40+2. —+— 
is 1 70 une 2528 1519 


10 — 2.756 (1.579) SM, — Wh, < 10 + 2.756 (1.579) 
10—4.353 Sp, — pw, S 10 +. 4.353 
5.647 S pL, — ww, S 14.353 
The complete statement of the confidence level with the associated probability is 
P(5.647 << 14.353) =99 


5.9 CONFIDENCE INTERVAL ON 2, ~, FOR PAIRED OBSERVATIONS 


In general, suppose that the data consists of pails (XX). (X .. Ay) 1:5 (AG, Ke) BOT X Pamrleeee 
assumed to be normally distributed with mean KM, and p,. ee - caidout variables er different 
pairs are independent. Since, there are two measurements on the same experimental unit, the two 
measurements within the same pair may not be independent. Consider us n differences D, = X,, - X,, 
DC», ereny 3 X,, — X,,, Now the mean of the differences D, say ve 


Mu, = ED) = DX —X,) = KX) - F(X) =p, -y 
because the expected value of X, — X, is the difference in expected values regardless of whether X, and X, 


are independent. Hence, we can construct a confidence interval on MU, — LH, by just finding a contidence 
interval on |t,. To construct the confidence interval for LL, = Hl, — LL, note that 


LL, 


Se 


follows a f-distribution with n — 1 degrees of freedom. Then, P(-t, 2 T St,,,_,) = | — Gawewaneulaeitute 
for TJ in the above expression and perform the necessary steps to isolate M, =H, — Ht, between the inequalities. 
This leads to the following 100 (1 — a)% confidence interval on , ~ ,. The differences D are normally and 
independently distributed, we can apply the t-distribution procedure described earlier to tind the confidence 


interval on [1,. By analogy with Eq. (5.10), the 100 (1 — a)% confidence interval on w= LW, — WL, is 


D~tej2.n-18D Ins py S$ D+tayoniSp ivn (5.32) 


where D and s, are the sample mean and sample standard deviation of the differences D.. respectively. 
This confidence interval is valid for the case where 6; #03. because 57, estimates oj, = VX, — X,) and tor 
large samples (n 2 30 pairs) and the assumption of normality is not necessary. 

Summarising, if g ands, are the sample mean and standard deviation of the difference of » random 
pairs of normally distributed measurements, a 100(1 — o)% confidence interval on the difference in means 
LM, =H, — H, is 

Ee i8e Vn ups Oe ye iVn 


where 7, _, is the upper 0/2% of point of the t-distribution with n — | degrees of freedom. 
2, n= 


= : , : 2 ees 2 
This confidence interval is also valid for the case where 6; # 03. since sp = V(X, — X,). Also, for large 
samples (say, 7 > 30), the explicit assumption of normality is unnecessary because of the central limit theorem. 
The procedure to find a confidence interval for the difference between two population means, [, and 


uu, is given in Table 5.7. 
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Table 5.7: Procedure to find a confidence interval for the difference between two 
population means yp, and yp, 


Assumptions: 
1. Simple random paired samples 
2. Normal differences or large sample 


Step 1: For a confidence level of 1 — a, use the table in Appendix-G to find to. with df=n—1, 
Step 2: The end points of the confidence interval for [4 — Uy are 


= My 
d tty 


Vn 


Step 3: Interpret the confidence interval. 


The confidence interval is exact for normal differences and is approximately correct for large samples 
and normal differences. 


Example E5.20 


A company wanted to find out if attending a special course on “how to be a successful salesperson” can 
increase the average sales of its employees. The company sent 10 of its salespersons to attend this course. 
The following table gives the one-week sales of these salespersons before and after they attended this 
course. 


25 
After WE es 


26 


Make a 95% confidence interval for the mean y, of the population paired differences where a paired difference 
is equal to the sales before and after attending the special course. Assume that the population of paired 
differences is approximately normally distributed. 


SOLUTION: 
ia = NO 


Difference.d | d'_| 
11 17 —6 
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Gel s@o 7 


sj=-b= = Oon74 
‘ Vn i0 


df=n—-1=10-1=9; o/2=0.025 
ee e202. 
The 95% confidence interval for 1, is 


From the table in Appendix-G, t, 


d +t5;=-5 + 2.262(0.5774) 
or 6.3060 to 3.6940. 


Example E5.21 


A college claims that the Math tutoring service offers significantly increases the test scores of students in 
mathematics. The following table gives the scores per 120 of 12 students before and after they took the 
tutoring help. 


| Before | 81 | 75 | 88 | 90 | 67 | 71 | 92_| 


| After_| 96 | 74 | 93 | 79 | 79 | 73 | 111 | 76 | 75 | 78 | 79 | 84 | 


Make a 95% confidence interval for the mean 1, of the population paired differences where a paired difference 
is equal to the score before attending the tutoring service minus the score after attending the tutoring 
service. Assume that the population of paired differences is approximately normally distributed. 


SOLUTION: 
n= 


Difference, d 
=|5' 
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).95 
df=n—1=12-1=11; w2=0.05- 2°? = 0.025 


From the table in Appendix-G, ¢,, .), = 2-201 
The 95% confidence interval for 1, is 
+ t=-5,25 + (2.201)(2.2365) 


or —10.1725 to -0.3275 


5.10 CONFIDENCE INTERVAL ON THE RATIO OF VARIANCE OF TWO 


NORMAL DISTRIBUTIONS 


Let X, and X, are independent normal random variables with unknown means pL, and w, and unknown 


variances o- and O3 . respectively. We wish to find a 100 (1 — @)% confidence interval on the ratio of 
07/03 . Let two random samples of sizes n, and n, are taken on X, and X,, and let S° and $5 denote the 
sample variances. The sampling distribution of 


pe 22 
S; /O; 


is F with n, — 1 and n, — | degrees of freedom. This distribution is shown in Fig. 5.4. 


ol2 


al2 


0 F, af2n, 1n,-1 af2.n. 1n,-1 


Fig. 5.4: The distribution of F, 


ys =il 


From Fig. 5.4, we observe that 


reser ey ie Fyne 1-o@ 
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82/02 
or LO) 
PY Farin Ss 52 / 2 = Eagle =l-a 
3; 10) 
H We o S2 
ence male 1 1 é 
P ge Fial2,n.-1 n—l = o Ss Pee = lo (33) 


Comparing Eqs. (5.33) and (5.4), we see that a 100 (1 — a)% two-sided confidence interval on Oo; Foe 


Si Sse 
Bah Ee eto, 

2 * 1-o/2,n,-1,n,-1 => eae a 

Ss My hy A s2 o/2,n,-1,n,-1 (5.34) 


where the lower 1 — /2 tail-point of the F,,-1,,,-1 distribution is given by 
F _ 1 
101! 2am lay = (5.35) 


a/2,n, = n,—1 


We can also construct one-sided confidence intervals. A 100 (1 — «)% lower-confidence limit on on / 63 is 


2 
Si F 2 OI 
2 f1-a.n,-1n, -1 = TT 5.36 
c n—1,n, o: (5.36) 


while a 100(1 — &)% upper-confidence interval on oF / om is 


2 2 
Sp 
= sn-l,n,-1 
05 Se ita (5.37) 
where Fojon,-1n,-1 29d Fy_o/2.n,-1.n,-1 are the upper and lower G/2 percentage points of the F-distribution 


with n,— 1 numerator and n, — 1 denominator degrees of freedom respectively. A confidence interval on the 
ratio of the standard deviations can be obtained by taking the square roots in Eq. (5.34). 


Example E5.22 


Two samples are drawn from two independent populations, which are normally distributed. A comparison 
of the samples is to be made by developing a 99% confidence interval on the ratio of their variances. The 


following information is known: n, =n, = 21, Se =) and Se = 8. Determine the complete statement of the 


confidence interval, with the associated probability. 


SOLUTION: 


A 100 (1 — «&)% two-sided confidence interval on of / 05 is given by 


Oe co, S; 
se F aesaiw,.v, = oo = Me Fepaeay: 
where v,=n,-1landv,=n,-1 
Therefore, F  wss2099 = 240 tO 2 niga 1/2.46 = 0.4065 (from the table in Appendix-H). 
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Confidence interval is given by 


or 


The complete statement of the confidence interval with the associated probability is 


Example E5.23 


The diameter of steel rods manufactured on two different machines is being investigated. Two random 


samples of sizes n, = 15 and n, = 17 are selected and the sample means and sample variations are: aioe 


Sim0e X 5 = 8.95 and $3 = 0.40, respectively. Assume that the data is drawn from a normal distribution. 


2 
1 


> (0. 4065) <— <2 (2.46) 


0.254 < 21 <1,538 


oo 


P{oass oe 538 


2 


Construct the following: 


(a) 
(b) 


(c) 


a 90% two-sided confidence interval on o,/o, 


a 95% two-sided confidence interval on 6,/o,. Comment on the comparison of the width of this 


interval with width of the interval in part (a) 
a 90% lower-confidence bound on 0,/0,. 


SOLUTION: 


(a) 


n, =15,n,=17 
Sposa mer = OAl2 


Saas ty lt — pie 


90% confidence interval for the ratio of variances: 


S; oi {Si 
3) Pipe pena ty a <(% Jolin ee 


Joos.iaa7 = 2-33 and fos 1446 = 0.412 (from the table in Appendix-H) 


[ps Joins St Oo; L-<( 02) 2.33 
04 ao, \04 


0.309 < 2 < 1.7475 
3: 


0.55588 < 2! < 1.32193 
07 
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(b) 95% confidence interval: 


5 


S 
al | fe} 
| S2 Fica/tn me 


Dy) 
Z 


S 
83 
So.ons.14i6 = 2-82 and f, 0.44 4¢ = 0.342 (from the table in Appendix-H) 


< 


| ae n;,—I,n,—1 


\ 


03 ome 
lea 0.342 < SL <| 75] 2.89 
0.4 G5 0.4 


0.2565 < 2 <2.115 


05 


0.50646 < 2! < 1.454304 
0» 
The 95% confidence interval is wider than the 90% confidence interval. 


(c) 90% lower-sided confidence interval: 
St on 
foitais = 9-5 (from the table in Appendix-H) 


[22 )o5s “al 
0.4, 0; 


2 
0.375<—) 
05 


or 0.61237 < 2. 


O» 


5.11 CONFIDENCE INTERVAL ON THE DIFFERENCE IN TWO 
PROPORTIONS 


Here, we construct a confidence interval for p, — p, under the following assumptions: 


1. The population proportions p, and p, are not too close to 0 or 1. 
2. Two random samples are taken, one from each population, and the two samples are independent. 


3. Sample sizes n, and n, are large. 

If two independent samples of size n, and n, are taken from infinite populations so that X, and X, are 
independent, binomial random variables with parameters (71, P,) and (11,, p,), respectively, where X, represents 
the number of sample observations from the first population that belongs to a class of interest and X, 
represents the number of sample observations from the second population that belongs to a class of interest, 
then p,= X,/n, and p,= X,/n, are independent estimator of p, and p, respectively. Considering the fact 


that the normal approximation to the binomial applies, the statistic 
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a Py — Pp — (Piz Pr) 
P\- Py) , P2(l= P2) 
ny Ny 
is distributed approximately as standard normal. An approximate 100 (1 — a)% two-sided confidence interval 
fon p, — pts 


Pid P,) , Pol Po) P\l-P:) | Po(- Po) 


Py — Pa — Za/2 < Pi — Po S By — Py + Zu2 (5.38) 
n, Ny nh Ny 
An approximate 100 (1 — «)% lower-confidence interval for p, — p, is 
: PiA-P,) | Po - Po) 
P\ — P2 ~ Za12 a + < Pr 


ny Ny 
and an approximate 100 (1 — a)% upper-confidence interval for p, — p, is 


AU-A) , Pl-pr) 
n Ny 


Pi — P2 S Py — Py + Zq)2 


where <,,, is the upper @/2 percentage point of the standard normal distribution. The procedure to find a 
confidence interval for the difference between two population proportions, p, and p, is given in Table 5.8. 
The margin of error is 


PP), Bo Pr) 
ny Ny 


E= Za/2 


Table 5.8: Procedure to find a confidence interval for the difference between two 
population proportions p, and p, 


Assumptions: 

1. Simple random samples 

2. Independent samples 

3. X, M, — Xj, X2 and ny — X; are all 5 or greater 


Step 1: For aconfidence level of 1 — a, use the table in Appendix-E to find z,,) 


Step 2: The end points of the confidence interval for p, — p> are 
PA), PQ pr) 
nh 7) 


where Z,, is the upper 0/2 percentage point of the standard normal distribution. 
Interpret the confidence interval. 


(DP; i pr)+ La/2 


Summary of confidence intervals for single population and two populations are given in Tables 5.9 and 
5.10 respectively. 


// Estimation // 241 
eS tmaation I! 241 
Example E5.24 


An investigation on body mass index (bmi — a measure of body fat based on height and weight) for adults, 
a bmi of greater than 25 indicates an above health weight (i.e., overweight or obese). Of 800 randomly 
selected adults whose highest degree is bachelors, 412 have an above healthy weight: and of 1000 randomly 
selected adults with a graduate degree, 474 have an above healthy weight. 


(a) determine a 90% confidence interval for the difference between the percentages of adults in the 
two degree categories who have an above healthy weight. 


(b) repeat part (a) for an 80% confidence interval. 
SOLUTION: 


412 
(a) Population 1: Bachelors degree, p, = 300 =().515 


~ — 474 
Population 2: Graduate degree, p, = 1000 = 0.474 


Zp = 1.645 (from the table in Appendix-£) 
90% confidence interval is 


POA), PBs) 
ny Ny 


Pi — Pr t 2a/2 


05150-0315) n 0.474(1 — 0.474) 
800 1000 


(0.515 —0.474)  1.645V9.752813x 10+ + 2.49324 x10“ 


0.041 + 0.057566 or —0.016566 to 0.098566 
(b) The 80% confidence interval is given by 


(0.515 —0.474)+1.645 


Ad—p), Pod-P) 
nh Ny 


Py — Bz tZas2 5 Z, = 1.28 (from Appendix-E) 


0.515(1—0.515) " 0.474(1— 0.474) 
800 1000 
0.041 + 0.04793 or 0.0037929 to 0.08893. 


Table 5.9: Summary of confidential intervals (single population) 


(0.515 — 0.474) £1.28 


Limits of confidence 


Nature of the Parameter on Procedure 


population which confidence interval with confidence 
intervals is set coefficient 
l-a 
. | Quantitative data; u, the population |Draw a sample of size n and) | oO 
7 th xz “m2 
standard deviation O@ |mean compute the value of X , the un 


is known; Population estimate of | 


normal 
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S 


Draw a sample of sizen and | — 
X Eby 4.0/2 i 
n 


compute X and 


S= (23,= 5) 
n-l 


Draw a sample of size n and 
compute X and S 


. |Quantitative data; 
standard deviation o 
is not known; 
population normal; 
important when 
sample size is small 
(n < 30) 

. |Quantitative data; 
standard deviation 6 
is not known; 
population not 
necessarily normal; 
sample size is large 
(n = 30) 

. |Quantitative data; 
binomial case 


u, the population 
mean 


tn-1, v2 iS the value obtained 
from the ¢-distribution with 
n— 1 degrees of freedom 


u, the population 


= S 
seh TG 
mean XZ, 


“vn 
Confidence interval is 
approximate 


Draw a sample of size n and 
note X, the number of 
successes; obtain X/n, the 
estimate of p. Here n is 
assumed large 


p, the probability 
of success (the 

population 
proportion) 


Confidence interval is based 
on the central limit theorem, 
thus, is approximate 


Ges Gas 
Nature of the Parameter Procedure Limits of confidence interval 
population on which with confidence coefficient 
confidence 
intervals is set 
Quantitative Ly — Up, the Draw samples of sizes m and 
data; variances difference of the |n from the two populations, | (X, —X,)+Z,, 
G. and o>are —_—| population get the respective means 
rowecnaih means X,and X, and find the value 
populations are of X,—X>, an estimate of 
normally 
distributed Hi ~ Ma 


Quantitative [Ly — My, the Draw samples of sizesn, and | — | 1 1 
data; 67,0 are | difference of the |” from the two populations, | (X1 — X2)£tnin-2,4;95p4 [— +— 
not known but _ | Population compute X,—X,and Te he 


assumed equal; |™eans 
both populations 


. |Quantitative data; 
population normal 


o°, the population |Draw a sample of size n and 


+ ip) 
Vanhenive compute § 5} a) 
Xn-1a/2 Xn-1,1-a/2 


Table 6.10: Summary of confidence intervals (two populations) 


(n, -1)S? +(n, -1)S3 


are normally mt+n—2 
distributed 
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Nature of the Parameter Procedure Limits of confidence interval with 
population on which confidence coefficient 
confidence ae 
intervals is set 
3. | Quantitative Ly — Up, the Draw samples of sizes n, 
data; o7 and re of the |and n, compute X Seay Ok ,-X, ) + Z5/2,{— 
opulation 2 
©; eu nown He compute Sj and S 2 Confidence interval is 
and not assumed approximate 
equal; 
populations may 
not be normal; 
sample sizes m 
and n are large 
4. | Quantitative data | up = Wy — Wb Paired analysis calculates | — ce 
|, and by are the |Difference in —_| differences and uses one d bajar AB Sip 
means of means of two sample method for inference 
populations 1 normal on the mean difference <d+ tase Ba 
and 2 distributions ina ee 
respectively. paired analysis 
Standard 
deviation of 
differences are 
unknown 
5. | Quantitative o./0; Draw 2 samples, one from | Sy P Go; 
data; 6? and to of pi ieee compute 52 F-a2.n,-n-1 S o 
G, are the variances of two | 5; and S> > 
variances of noma St See 
population 1 and distribution S5 
2; o; and 
o; known 
Quantitative P1—P2, the Draw two samples, one Xi Te 


difference of the | from each population, of 


population 


data; p, is the 


ny 
proportion in 


sizes n, and ny; find Xa and 
: ny 

P2 in population 
2; sample sizes 
n, and nz are 
assumed large 


Xo , the estimates of p, and Zoi? 


7) 
P2 tespectively; then get 
Mi _ ke 

mh 


Confidence interval is based on 
the central limit theorem, thus is 
approximate 
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SAMPLE SIZE SELECTION 


5.12 


5.12.1 Sample Size Selection for Estimating Population Mean 


Each level is constructed with regard to a given confidence level and is called a confidence interval. The 
confidence level associated with a confidence interval states how much confidence we have that this interval 
contains the true population parameter. The confidence level is denoted by 100 (1 — &)%. When expressed 
as probability, it is called the confidence coefficient and is denoted by (1 — &). & is called the significance 
level. 

The confidence interval for for large samples is given as follows: 


The 100 (1 — «)% confidence interval for [1 is 


peas doe if 6 is known 
XZ if 6 is unknown (5.39) 
where 5, =ovn and s,=s/Vn 


The value of z used here is obtained from standard normal distribution table for the given confidence 
level. The quantity zo, (or zs, when o is not known) in the confidence interval formula is called the maximum 
error of estimate and is denoted by E. 


Thus, E=zs or E=20, 
The standard deviation of the sample mean is o/Vn 


0 


aaa (5.40) 


Given the confidence level and the standard deviation of the population, the sample size that will 
produce a predetermined error E of the confidence interval estimate of [ is 


¥, 1) 


i 


(5.41) 


where n = required sample size. 
We note that 


1. As the desired length of the interval 2E decreases, the required sample size n increases for a fixed 
value of o and specified confidence. 


2. As © increases, the required sample size n increases for a fixed desired length 2E and specified 
confidence. 


3. As the level of confidence increases, the required sample size n increases for fixed desired length 
2E and standard deviation o. 


Example E5.25 


At a 99% confidence level, the standard deviation 6 for the population is 0.7. How large a sample should be 
selected if we want the estimate to be within 0.01 of the population mean? 
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SOLUTION: 
The mean size is ¥+0.01 


Therefore, the maximum size of the error of the estimate is to be 0.01, that is E = 0.01. The value of z for a 
99% confidence level is 2.58. The value of o is given to be 0.7. Hence, substituting the values and simplifying 
Eq. (5.41), we get 


miSigue 58) (0); 2 
| (0.01) 


= 


32,616 


Hence, the required sample size is 32, 616. 


9.12.2 Sample Size for the Estimation of Proportion 


The maximum error, E, of the interval estimation of the population proportion is 


n 
The above expression can be manipulated algebraically to write it in terms of E, p, g and z and the final 
result is given here. Given the confidence level and the values of p and q, the sample size that will produce 
a predetermined maximum error E of the confidence interval estimate p is given by 


_= pq 
oe 

Another approach to choosing n uses the fact that the sample size from the above equation will always 
be a maximum for p = 0.5 (q = 0.5) or p(1 — p) $ 0.25 with equality for p = 0.5. This can be used to obtain an 


n 


upper bound on n. We are at least 100 (1 — «)% confident that the error in estimating p by pis less than E 


if the sample size is 


oy 


v4, 


n= (0.25) 


E 2 
Example E5.26 


(a) How large a sample should be selected so that the maximum error of estimated for a 99% confidence 
interval for p is 0.04 when the value of the sample proportion obtained from a preliminary sample 
is 0.2? 

(b) Find the most conservative sample size that will produce the maximum error for a 99% confidence 
interval for p equal to 0.04. 


SOLUTION: 
(a) Here E=0.04, p =0.2, Gg =1-02=08 
z = 2.58 for 99% confidence level (from the table in Appendix-F) 


2an 2 
Hence py ad ipe 298 (O20 8) ee 564 = 67 
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(b) E=0.04, z= 2.58, and p = g = 0.50 for most conservative sample size. 


z* pq _ 2.58°(0.5)(0.5) 
E° (0.04)? 


In this chapter, we introduced that part of statistics called inferential statistics. In Chapter 1, inferential 
statistics was described as the part of statistics that help us to make decisions about some characteristics 
of a population based on sample information. Inferential statistics makes use of the sample results to make 
decisions and draw conclusions about the population from which the sample is drawn. Estimation is the 
first topic to be covered in our presentation of inferential statistics. Estimation and hypothesis testing 
(presented in Chapter 6) topics taken together are usually referred to as making inferences. This chapter 
examined how to estimate the population mean, population proportion for a single population. We have 
extended our discussion of estimation to the difference between two population means and the difference 
between two population proportions. 


|PROBLEMS — 


PS.l A manufacturing company tested 25 printed circuit boards and the findings are as follows: Mean 
length of life for 25 boards = 3,566 hours. Standard deviation of life length in sample = 150 hours. 
Construct a 90% confidence interval for the mean length of life for the new printed circuit boards. 


Thus, n= = 1040.062 = 1040 


P5.2. ‘The mean tensile strength of a sample of 25 high quality steel specimens equals to 50,000 MPa. 
Find a 95% confidence interval on the mean tensile strength if the standard deviation is known to 
be SOO MPa. 

P5.3. (a) The standard deviation of a normally distribution manufacturing process is 3g. Determine 

the sample size for a 90% confidence interval so that the estimation of the mean process is 
within |g of the true but unknown mean yield. 


(b) An electrical light bulb manufacturing company likes to estimate the light bulb’s mean life. 
Assume a normal distribution of the life of a light bulb. Assuming the standard deviation is 
30 hours, find the how many bulbs should be tested so as to be: (i) 95% confident that the 
estimate x will not differ from the true mean life by more than 10 hours. (ii) 98% confident 
to accomplish (1). 


P5.4 =A college administrator wishes to estimate the mean age of all students currently enrolled. In a 
random sample of 20 students, the mean age is found to be 21 years. From past studies, the standard 


deviation is known to be 1.5 years. Construct a 90% confidence interval of the population mean 
age. 


PS:5 In order to estimate the amount of time (in minutes) that a teller spends on a customer, a bank 
manager decided to observe 64 customers picked at random. The amount of time the teller spent 
on each customer was recorded. It was found that a sample mean was 4 minutes will standard 
deviation 1.2 minutes. Find a 98% confidence interval for the mean amount of time UL. 


P5.6 = When 25 cigarettes of a particular brand were tested in a laboratory for the amount of tar content, 
it was found that their mean content was 20 milligrams with S = 2 milligrams. Set a 90% confidence 
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interval for the mean tar content u in the population of cigarettes of the brand. Assume that the 
amount of tar in a cigarette is normally distributed. 


The breaking strength of a machine part is required to be at least 75 MPa. Based on previous test 
data, the standard deviation of breaking strength is 6 MPa. A random sample of 16 specimens is 
tested and the average value is found to be 73 MPa. Construct a 95% confidence interval on the 
mean breaking strength. 


The diameters of copper shafts produced by a certain manufacturing process should have a mean 
diameter of 0.51 mm. The diameter is known to have a standard deviation of 0.0002 mm. A random 


sample 40 shafts has an average diameter of 0.509 mm. Construct a 95% confidence interval on the 
mean shaft diameter. 


A normally distributed random variable has a known variance of 6? = 36 and unknown mean. 
Construct a 95% confidence interval on the mean that has a total width of 2.0. 


In an experimental lab, 10 students are selected at random from a class and were assigned to set 
up an experiment. Their time to complete the experimental set up are recorded (in minutes) as 51, 
55, 54, 50, 56, 59, 57, 59, 58 and 60. Use a 95% confidence interval to estimate 1, the mean time for 
the student to set up the experiment. Assume that the population is normally distributed. 


A study claims that students in a particular college spend an average of 18 hours per week on 
leisure activities. The administration wanted to test this claim. A sample of 10 students selected at 
random gave their responses as follows (in hours): 


WG, 19,22; 23, 25, 26, 33, 38 and 41 


Assume that the time spent on leisure activities by all the students is normally distributed. Use a 
95% confidence interval activities. 


The time required to change flat tires in a garage is a normal distributed random variable. The tire 
changing times for 15 automobiles selected at random are as follows: 


Seon 710, 2421174, 13; 10; 9, 20; 8, 19,17 and 25 
Use a 95% confidence interval to estimate ft, the mean time required to change flat tires in that 
garage. 


A sample of 26 observations selected from a normally distributed population produced a sample 
variance of 35. Construct a confidence interval for o* for each of the following confidence levels. 


(a) 1-a=0.99 
(b) 1-a0=0.95 
(c) 1-a=0.90 


Refer to Problem 5.31. For the two steel specimen types in Prob. 5.31, determine a two-sided 95% 
confidence interval on the true variance. 


In a random sample of 30 machine turned parts, the sample standard deviation of the diameter of 
the machined parts was found to be 10 mm. Compute a 90% upper confidence interval on the 
variance of the diameter of the turned parts. 


The following data represent the amount of corrosion coating (in ounces) need to paint 6 plates: 


8.2, 8.8, 7.7, 7.9, 8.4 and 7.9 
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Determine the following: 

(a) a point estimate of 6’, the variance of the amount of corrosion coating needed to paint a 
plate 

(b) confidence intervals for o? and o with the following confidence levels, assuming that the 
amount of corrosion coating per plate is normally distributed. (i) 95%, (ii) 98%. 

Suppose 30 samples of an allergic medicine was randomly selected and weighed. The sample standard 

deviation is 1.3 milligrams. Assuming the weights are normally distributed, construct a 99% 

confidence intervals for the population variance and standard deviation. 


The following measurements (in kg) were obtained on the weights of 6 manufactured metal parts 
picked at random: 12.1, 8.1, 7.2, 12.3, 14.05 and 13. Determine the confidence intervals for o” and o 
with 95% confidence level. 


The manufacturer of a new line of automotive transmission is estimating the fraction of non- 
conforming units produced. Find a 95% two-sided confidence interval for the following: 

(a) 150 transmissions in the sample with 15 non-conforming 

(b) 1500 transmissions in the sample with 150 non-conforming 

(c) 15,000 transmissions in the sample with 1500 non-conforming 


With a random sample of 1000 people, 400 are women who turn up to see a baseball game. Find a 
95% confidence interval for the percentage of women at the game. 


A sample of 600 observations selected from a population produced a sample proportion equal to 
0.7. 


(a) make a 90% confidence interval for p. 

(b) construct a 95% confidence interval for p. 

(c) make a 99% confidence interval for p. 

(d) does the width of the confidence intervals constructed in part (a) through (c) increase as 
the confidence level increases? 


In a survey, 90% of drivers rated their driving as excellent or good. Suppose that this percentage 
was based on a random sample of 400 drivers, 

(a) what is the point estimate of this corresponding population proportion? 

(b) find a 95% confidence interval for the corresponding population proportion. 

A mail order company guarantees its customers that the products ordered will be mailed within 72 


hours after an order is receives. The quality control department took a sample of 50 orders and 

found that 40 of them were mailed within 72 hours of the placement of the orders. 

(a) construct a 98% confidence interval for the percentage of all orders that are mailed within 72 
hours of their placement 


(b) if the confidence interval found in part (a) is too wide, suggest a way to reduce the width of 
the interval. 


In a poll of 2000 adults, 60% of adults said that public education needs to be improved. Construct 
a 95% confidence interval for the proportion of all adults who hold this opinion. 


If a random sample of 60 non-smokers have a mean life of 75 years with a standard deviation of 
7 years, and a random sample of 50 smokers live 60 years with a standard deviation of 8 years, find 
a 95% confidence interval for the difference of mean life time of non-smokers and smokers. 
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that department. In a random sample of 100 students in Mechanical Engineering courses this year. 
80 received A or B; in a sample of 120 grades in Mechanical Engineering 5 years ago, 65 were A or 
B. Find a 90% confidence interval for the difference in the percentage of A and B grades now and 5 
years ago. 


Machine systems for a manufacturing company are supplied by two suppliers. Machines from 
supplier | have a standard deviation of 25 units and supplier 2 have a standard deviation of 
30 units. A random sample of 10 machines from supplier | have an average of 155 units and a 
similar sample of supplier 2 machines have an average of 137.4 units. Construct a 90% confidence 
interval on the true mean difference in the means of two machines. 


At the end of crash dieting program administered to 49 men and 36 women, the following information 
was obtained about the loss of weight (in kg). 


Mean loss of weight 


S, sample standard deviation 


Find an approximate 85% confidence interval for the difference in the mean losses. 


A consulting agency was asked by a large health insurance company to investigate if business 
majors were better sales persons. A sample of 40 sales persons with a business degree showed 
that they sold an average of 10 insurance policies per week with a standard deviations of 1.80 
policies. Another sample of 45 sales persons with a degree other than business showed that they 
sold an average of 8.5 insurance policies per week with a standard deviation of 1.35 policies. 
Construct a 99% confidence interval for the difference between the two population means. 

An administrator wants to test if the mean GPAs (grade point averages) of all male and female 
college students who actively participate in sports are different. A random sample of 28 male and 
24 female students who actively participate in sports was selected. The mean GPAs of the two 
groups are found to be 2.62 and 2.74 out of 4.0 respectively. with the corresponding standard 
deviations equal to 0.43 and 0.38. Assume that the GPAs of all male and female students have a 
normal distribution with the same standard deviation. Construct a 90% confidence interval for the 
difference between the two population means. 

Two slightly different steel specimen types of bar samples are being tested for tensile strength. 
Ten samples of each specimen type were selected one at random from ten different lots. The tensile 
tests on these bars resulted in 


ae 2 Mais, —2.2( Mra)", 
X, =25.5 MPaand s; =2.7 (MPa)’. 


Assume the tensile strengths are normally distributed. 
(a) find a 99% confidence interval on the difference in means 


(b) if only 5 samples were selected for the second specimen type, how does this affect the 


: = 2 s 
confidence interval? Assume the value X, of s; and remain unchanged. 
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The outputs of two manufacturing firms over one year period with random days picked gave the 
following data: 


mie X, =250 35 
n, =25 X, =200 s; =40 


Assuming the two outputs are normally distributed with the same variance, find a 95% two-sided 
confidence interval on the difference in their means. 


Random sample of weights were taken from two packages in a packaging process. The first sample 
consisted of n, = 150 and the second consisted of n, = 250 packages. The two sample means and 
standard deviations are as follows: 


X= 10k X, =9.8kg 
s, =0.25 kg , = 0.35 kg 
Determine the interval within which we can be 90% sure that 1, — 11, lies. 


Two diets, diet A and diet B, were fed to two groups of randomly picked cats of a certain bread. 


Diet A was fed to 10 cats. Their mean weight X, was 10 kg, and the sample standard deviation S, 


was 3 kg. Diet B was fed to 14 cats. Their mean weight X, was 12 kg and the sample standard 


deviation S, was 4 kg. Find a confidence interval for the difference in means with a 90% confidence 
level. Assume the weights of the cats given diet A and the weights of the cats given diet B are 
normally distributed with the same variance. 


Ten samples of standard cement had an average weight per cent calcium of X , = 80, with a sample 
standard deviation of S, = 5, while 15 samples of lead-doped cement has an average weight per 
cent calcium of X, = 77 with a sample standard deviation of S, = 4. Assume that the weight per 


cent calcium is normally distributed. Find a 95% confidence interval on the difference in means, 
Ll, — W,, for two types of cement. Further, assume that both normal populations have the same 
standard deviation. 


A new voltage-regulating device is installed in an electromechanical system. Before the installation, 
a random sample yielded the following information about the percentage of breakdowns of the 
system: X, =8, $7 = 36 and n, = 8. After installation, a random sample yields X, =7, $3 = 25 
and n, = 9. Find a confidence interval for the difference of the means with a 90% confidence level. 

Table P5.37 gives the systolic blood pressure of 7 adults before and after the completion of a 
special dietary plan based on a special dietary plan for 3 months. Construct a 95% confidence 


interval for t,. Assume that the population of paired differences, 1 p 1S (approximately) normally 
distributed. 


Table P5.37 
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A college claims that the Math tutoring service offers significantly increases the test scores of 


students in mathematics. The following table gives the scores per 120 of 8 students before and 
after they took the tutoring help. 


Before | 2 82 | 75 | 91 | 66 | 70| 91 | 69 | 

| After | 94 | 111 | 80 | 72 
Make a 95% confidence interval for the mean 1 , of the population paired differences where a paired 
difference is equal to the score before attending the tutoring service minus the score after attending 


the tutoring service. Assume that the population of paired differences is approximately normally 
distributed. 


A medical agency measured the corneal thickness of 8 patients who had glaucoma in one eye but 
not in the other. The following are the data on corneal thickness in microns. 


Patient | Normal | Glaucoma 


Assume that the population of paired differences is approximately normally distributed. Make a 
95% confidence interval for the mean 1, of the population difference, where a paired difference is 
equal to the difference in corneal thickness for normal and Glaucoma in microns. 


A company wanted to find out if attending a special course on “How to be a successful 
salesperson” can increase sales of its employees. The company sent 6 of its sales persons to 
attend this course. The following table gives the one-week sales of these salespersons before and 
after they attended this course. 


Make a 95% confidence interval for the mean 1, of the population paired differences where a paired 
difference is equal to the sales before and after attending the special course. Assume that the 
population of paired differences is approximately normally distributed. 


A company claims that its 12-week special exercise program significantly reduces weight. A random 
sample of 6 persons was selected, and these persons were put on this exercise program for |2 weeks. 
The following table gives the weight (in kg) of those 6 persons before and after the program. 


[ Before | 80 | 85 | 65 | 100 | 103 | 97 | 


Make a 95% confidence interval for the mean 1, of the population paired differences where a paired 
difference is equal to the weight before joining this exercise program minus the weight at the end 


of the 12-week program. 
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A manufacturing company sent 10 of its employees for a special training course to increase their 
productivity. The following table gives the one-week production of these employees before and 
after they attend this course. 


22 | 24 BEL | 19 | 20 i ii 


Make a 99% confidence interval for the mean 1, of the population paired differences where a paired 
difference is equal to the production before taking this special course minus the production after 
taking the course. Assume the population of paired differences having a normal distribution. 


The soldering machine has a supposed life of 6 years. Two machines are under consideration for 
acquisition by the manufacturer. Ten samples of machine A were subjected to simulated operating 


conditions, and the sample variance of machine life was computed as Oe = 4 years’. For machine 


BAS 3 = 8 years? for n, = 12. Determine a 95% two-sided confidence interval on the ratio of O* / On 
assuming that the machine lives are normally distributed. 


The length of steel bars manufactured on two different machines is being investigated. Two random 
samples of sizes n, = 16 and n, = 18 are selected and the sample means and sample variances are 


X, =10, S; =0.4, X, =9.6and $5 = 0.45, respectively. Assuming o; /o}5 , construct a 95% two- 


sided confidence interval on ratio of the population variance 6; /o5 . Is it reasonable to conclude 
that the two variances are equal? 

A study has been made to compare the aluminium contents of two brands of machine parts. Ten 
parts of brand A had an average aluminium content of 3 grams with a standard deviation of 


0.4 grams, while 8 parts of brand B had an average aluminium content of 0.3 grams with a standard 
deviation of 0.5 grams. Assuming that the two sets of data are independent random samples from 


normal populations, with equal variance find a 98% confidence interval for o; / o5 : 


Two different grinding processes are used to finish certain machine part. Both processes can 
produce parts at identical mean surface roughness. The company would like to select the process 
having the least variability in surface roughness. A random sample of n, = 11 parts from the first 
process results in a sample standard deviation S, = 5 micro millimeters, and a random sample of 
, = 16 parts from the second process results in a sample standard deviation of S, = 4 micro 
millimeters. Find a 90% confidence interval on the ratio of the two standard deviations, 0/0, 


A study has been conducted to study the capability of a gauge by measuring the weights of 2 
sheets of paper. The data are shown below: 


Paper 1 S, = 0.091159 n,=15 
Paper 2 S, = 0.084499 n,=15 
Use a = 0.05 and find a confidence interval. 


The diameter of brass rods manufactured on two different machines is being studied. Two random 
samples of sizes n, = 15 and n, = 17 are selected, and the sample means and sample variances are 
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My = OTS: Se = (013). ye = 7.65 and S = 0.4 respectively. Assume that the data are drawn from a 
normal distribution. Construct the following: 
(a) a90% two-sided confidence interval on 0/0, 


(b) a95% two-sided confidence interval on 6,/o,. Compare the width of this interval with the 
width of the interval in part (a). 


(c) a90% lower confidence bound on 0,/o, 


A survey was conducted involving 500 adult whites and 300 adult blacks 18 years or older. The 

following question was asked: “Do you think that police officers treat criminal suspects differently 

in low-income neighbourhoods than in middle- or high-income neighbourhoods?”. The survey 
resulted in 300 whites responding yes and 250 blacks responding yes. 

(a) find a point estimate of p, — p,, the difference in the proportions, where p, is the proportion 
among all white adults who would respond yes and p, is the proportion among all black 
adults who would respond yes 

(b) obtain an approximate 95% confidence interval for p, — p,. 


A company wanted to estimate the difference between the percentages of users of two electric 

shavers who will never switch to another brand. In a sample of 500 users of electric shaver A 

taken, 100 said that they will never switch to another brand. In another sample of 400 users of 

brand B taken, 70 said they will never switch to another brand. 

(a) find the point estimate of p, — p,, where p, and p, are the proportion of all users of brands A 
and B respectively. 

(b) construct a 97% confidence interval for the difference between the proportions of all users 
of the two brands who will never switch. 


In a random sample of 90 turbine blades, 10 have a surface finish that is rougher than the 
specifications allow. Suppose that a modification is made in the surface roughness finishing process 
and that, subsequently, a second random sample of 90 turbine blades is obtained. The number of 
defective blades in this second sample is 8. Find an approximate 95% confidence interval on the 
difference in the proportion of defective turbine blades produced under the two processes. 


In a random sample of 50 people from state A, 40 said they favoured death penalty and 24 out of 
48 from state B were in favour of death penalty. Find a 95% confidence interval for p, — p,, where 
p, is the proportion of those in state A favouring death penalty and p, is the proportion of those in 
state B favouring death penalty. 


An independent health agency investigated the health of independent random samples of white 

and African-American elderly (aged 70 or older). Of the 5989 elderly surveyed, 629 had at least one 

stroke, whereas 203 of 1006 African-American elderly surveyed reported at least one stroke. Find a 

95% confidence interval for the difference between the stroke incidences of white and African- 

American elderly. 

In a survey of 1000 drivers 25-34 years old, 27% said they buckle up, whereas 350 of 1000 drivers 

45-64 years old said that they did. Find a 95% confidence interval for the difference between the 

proportion of seat-belt users for drivers in the age group 25-34 years and 45-64 years. 

(a) The standard deviation of the lifetimes of a particular brand of light bulbs is 120 hours. How 
large a sample must be so that we are 95% confident that the error in the estimated mean life 
of the lights bulbs is less than 8 hours? 
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(b) If the standard deviation of the lifetimes of the light bulbs is not known, how large the samples 
must be tested to be 95% confident that the error in the estimated mean life of all the light 
bulbs is less than 0.16? 


A certain manufacturing firm that manufactures a certain type of electronic part wants to estimate 
its mean life. Assuming that the standard deviation o is 50 hours, find how many bulbs should be 
tested so as to be: 


(a) 95% confident that the estimate X will not differ from the true mean life 4 more than 10 hours 
(b) 98% confident to accomplish accuracy in part (a). 


Assume a normal distribution of the life of the electronic part. 


A government transportation agency wants to estimate at a 95% confidence level the mean speed 
for all automobiles travelling on a specific national highway. From a previous study, the agency 
knows that the standard deviation of speeds of automobiles travelling on this highway is 
6 kilometers per hour, what sample size should the agency choose for the estimate to be within 
2 kilometers per hour of the population mean? 


The school administration wants to determine a 95% confidence interval for the mean number of 
hours that the school students spend doing homework per week. It was known that the standard 
deviation for hours spent per week by all school students during homework is 10 hours. How 
large a sample should the school administration select so that the estimate will be within | hour of 
the population mean? 


The government would like to estimate the mean family size for all families in a particular state at a 
99% confidence level. It is known that the standard deviation o for all sizes of all families in that 
state is 0.7. How large a sample should the government select if it wants its estimate to be within 
1% of the population mean? 


(a) How large a sample should be selected so that the maximum error of estimate for a 99% 
confidence interval for p is 0.04 when the value of the sample proportion obtained from a 
preliminary sample is 0.6? 

(b) Find the most conservation sample size that will produce the maximum error for a 98% 
confidence interval for p equal to 0.04? 


A hardware store company guarantees all hardware deliveries within 30 minutes of the placement 
of orders on the plane. An agency wants to estimate the proportion of all hardware delivered within 
30 minutes by the company. What is the most conservative estimate of the sample size that would 
limit the maximum error to within 0.03 of the population proportion for a 99% confidence interval? 


A transportation safety agency wants to estimate the proportion of all drivers who wear seat belts 
while driving. Assume that a preliminary study has shown that 85% of drivers wear seat belts 
while driving. How large should the sample size be so that the 90% confidence interval for the 
population proportion has a maximum error of 0.02? 


(a) A company wants to estimate the proportion of parts made by a particular machine that are 
defective. The company wants this estimate to be within 0.01 of the population proportion 
for a 95% confidence level. What is the most conservative estimate of the sample size that 
will limit the maximum error to within 0.01 of the population proportion? 

(b) Refer to Problem P5.63(a). Suppose a preliminary sample of 300 parts produced by that 
machine showed that 6% of them are defective. How large a sample should the company 
select so that the 95% confidence interval for p is within 0.01 of the population proportion? 
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__ REVIEW QUESTIONS 


Explain the meaning of the following terms used in estimation. 


(a) estimator and estimate (b) point estimate and interval estimate 
What is the point estimator of the population mean, uw? Describe how the margin of error for a 
point estimate of is calculated? 

What are the various alternatives for decreasing the width of a confidence interval? 

Explain the following concepts: 

(a) How the width of confidence interval decreases with an increase in the sample size? 

(b) How the width of confidence interval decreases with a decrease in the confidence level? 
Explain the difference between a confidence level and confidence interval. 

What is the maximum error of estimate for u for a large sample? How is it obtained? 

What are the similarities and the differences between the standard normal distribution and the 
t-distribution? . 

Describe the parameters of a normal distribution and the ¢-distribution. 

Explain the meaning of the degrees of freedom for a ¢-distribution. 

What assumptions are made in using the f-distribution to make a confidence interval for LW? 

What assumptions are made in using the normal distribution to make a confidence interval for the 
proportion population, p? 

What is the point estimator of the population proportion, p? How ts the margin of error for a point 
estimate of p calculated? 

Describe the following terms: 

(a) estimation (b) estimator 


(c) maximum error of estimate (d) t-distribution 


STATE TRUE OR FALSE 


A point estimate of a parameter is the value of a statistic used to estimate the parameter. 


(True/False) 
Confidence interval is an interval of numbers obtained from a point estimate of a parameter. 
(True/False) 
Confidence level is the confidence we have that the parameter lies in the confidence interval (ex 
that the confidence interval contains the parameter). (True/False) 
For a fixed sample size, increasing the confidence level improves the precision, and vice versa. 
(True/False) 
A statistical procedure is said to be robust if it 1s sensitive to departures from the assumptions on 
which it is based. (True/False) 
The margin of error equals the standard error of the mean multiplied by <¢,,.. (True/False) 
The margin of error can be determined if we know only the confidence level. (True/False) 
(True/False) 


The confidence level can be determined if we know only the margin of error. 
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The margin of error can be determined if we know only the confidence level, population standard 


deviation, and sample size. (True/False) 
The confidence level can be determined if we know only the margin of error, population standard 
deviation and sample size. (True/False) 


For samples of size n, the variable t = ( —p)Ms/Vn) has the f-distribution with n degrees of 
freedom. (True/False) 
When the population standard deviation is unknown, the confidence interval procedure for a 
population mean cannot be based on the standardised version of x . (True/False) 


A confidence-interval estimate of a parameter consists of an interval of numbers obtained from a 
point estimate of the parameter and a percentage that specifies how confident we are that the 


parameter lies in the interval. (True/False) 
Independent samples means that the sample selected from one of the populations has no effect or 
bearing on the sample selected from the other population. (True/False) 
The pooled t-procedures require equal population standard deviation. (True/False) 
The non-pooled t-procedures do not require equal population standard deviations. (True/False) 
The shape of a normal distribution is determined by its standard deviation. (True/False) 
By using a paired sample, extraneous sources of variation cannot be removed. (True/False) 


For very small sample sizes, the normality assumption is essential for both t-procedures (pooled 
and non-pooled). (True/False) 


ANSWERS TO STATE TRUE OR FALSE - 


1. True 2. True 3. True 4. False 5. False 6. True 7. False 8. False 9. True 10. True 
11. False 12. True 13. True 14. True 15. True 16. True 17. True 18. False 19. True 


000 


CHAPTER 


Hypothesis Testing 


In Chapter 5, we examined methods for obtaining confidence intervals for means, variances and proportions. 
In this chapter, we present another type of statistical inference regarding these same parameters. 


Quite often, there is a need to make decisions about populations on the basis of sample information. 
We often use inferential statistics to make decisions or judgments about the value of a parameter, such as a 
population mean. For example, we might need to decide whether the mean life, 1, of all machine components 
(such as a shaft) manufactured by a particular company differs from the advertised life of 5 years or we 
might want to determine whether the mean age, w, of all automobiles has increased from the 1955 year mean 
of 8.5 years. In attempting to reach decisions or judgments, often it is necessary to make assumptions or 
guesses about the population involved. Such assumptions, which may or may not be true, are called 
statistical hypothesis and in general are statements about the probability distributions of the populations. 

Decisions made about populations on the basis of sample information are called statistical decisions. 
A hypothesis is an assertion made about a population. That is, a hypothesis is a statement that something 
is true. The assertion concerns a numerical value of some parameter of the population. Procedures which 
helps us to decide whether to accept or reject hypothesis or to determine whether observed samples differ 
significantly from expected results are called test of hypothesis, tests of significance or results of decision. 

Here, parameters are the statistical constants of the population (such as the mean (yu). variance (0°), 
etc.) and the statistical measures computed from the sample observations (data) are called statistic (such 


as the mean (x), variance s’, S’, etc.). 


In general, the sample size greater than or equal to 30 (1 2 30) is considered as large sample in statistics 
whereas the sample size less than 30 (n < 30) is regarded as small. Nearly all the distributions can be 


approximated by a normal distribution. 


6.1: NULL HYPOTHESIS AND ALTERNATIVE HYPOTHESIS 


A null hypothesis is a claim (or a statement) about a population parameter that is assumed to be true until it 
is declared false. A null hypothesis is a hypothesis to be tested. We use the symbol H,, to represent the null 


hypothesis. 
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An alternative hypothesis is a claim about a population parameter that will be true if the null hypothesis 
is false. It is a hypothesis to be considered as an alternative to the null hypothesis. We use the symbol H, 
to represent the alternative hypothesis. 


The problem in a hypothesis test is to decide whether the null hypothesis should be rejected in favour 
of the alternative hypothesis. To test a hypothesis, we devise a procedure for taking a random sample, 
computing an appropriate fest statistic and then rejecting or failing to reject the null hypothesis H,. Part of 
this procedure is specifying the set of values for the test statistic that leads to rejection of H,. This set of 
values is called the critical region or rejection region for the test. 

The procedure in setting up a hypothesis test is to decide on the null hypothesis and the alternative 
hypothesis. For example, for specifying hypothesis test for the population mean, , we write 

Null Hypothesis: H,: = 1, 
where |, is some number. 

Alternative Hypothesis: The selection of the alternative hypothesis depends greatly on and should reflect 
the objective of the hypothesis test. Three choices are available for the alternative hypothesis. For example, 
1. Whether a population mean, L. is different from a specified value 1,, we write the alternative 
hypothesis as 
Ay WU, 
A hypothesis test whose alternative hypothesis has the above form is called a two-tailed test. 


2. Whether a population mean, y, is /ess than a specified value U.,, we write the alternative hypothesis 
as 


Area 
| 0 
A hypothesis test whose alternative hypothesis has the above form is called a left-tailed test. 


3. Whether a population, [. is greater than a specified value u,, we express the alternative hypothesis 
as 


A:u>p 
i 0 
A hypothesis test whose alternative hypothesis has the above form is called a right-tailed test. 
A hypothesis test is called a one-tailed test if it is either left-tailed or right-tailed. 


If H, specifies the population completely, then it is known as a simple hypothesis otherwise, it is called 
composite hypothesis. 


For instance, 
Hy: w= i, O = c, is a simple hypothesis. 
On the other hand, 


. A 2 
H): B< Uy, 0 = 06 


H,: H> Hy 0 = 66 
SEA ea oe o = 
AS B=y, 
Hpot= a2 


are all composite hypothesis. 
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The basic logic of hypothesis testing is as follows: Select a random sample from the given population. 

If the sample data are consistent with the null hypothesis, do not reject the null hypothesis: if the sample 

data are inconsistent with the null hypothesis (in the direction of alternative hypothesis), reject the null 
hypothesis and conclude that the alternative hypothesis is true. 


A simple hypothesis completely determines the distribution of X. whereas a composite hypothesis does 
not. For instance, the simple and null hypothesis H,: 4 = 10 indicates that the distribution of X is exactly 
the same as it was before the change, whereas the composite alternative hypothesis H,: 4 > 10 is less 
specific. It just indicates that the new mean is greater than 10, but does not specify what exactly the new 
mean is. In general, the null hypothesis will be simple and the alternative hypothesis will be composite. 


6.2 THE CRITICAL REGION © 


All values of the test statistic in the direction of the alternative hypothesis with a test statistic value less 
than or equal to the significance level « define a set called the critical region of the test statistic. According 
to the definition of o, @ is the probability that the test statistic will lie in the critical region. 


Example E6.1a 


If H,: 1 =U, is true, then the test statistic is the standard normal random variable z. Assume the level of 
significance &. Determine the critical regions for one-sided and two-sided alternative hypothesis. 


SOLUTION: 
As shown in Fig. E6.1, for a two-tailed test, the rejection region is on both the left and right. The critical 


values are +z_,; for a left-tailed test, the critical value is —z ; and for a right-tailed test, the critical value is 


aes 
a/2? So 


z,- From the table in Appendix-E, using the standard normal table, we obtain the values of <, corresponding 


to each of those five tail areas as shown below: 
Z0.01 Z0),005 
sy PQ STS 
As an example: 


For H,: U< U,, the critical region is all values z <—1.28, since p(z $—1.28) = 0.10 
For H,: b> UL, the critical region is all values z= 1.28, since p(z 2 1.28) = 0.10 
For H,: . # Uy the critical region is all values z S —1.645 or z2 1.645, 
since p(z < —1.645) + p(z 2 1.645) = 0.10 or [p(z 2 1.645 = 0.10/2 = 0.05}. 
If the test value lies in the critical region, then the null hypothesis is rejected at the a level of significance. 
On the other hand, if the test value is not in the critical region, then the null hypothesis is not rejected. 


6.3. TYPES OF SAMPLING ERRORS (TYPE I AND TYPE II ERRORS) 


There are two important types of errors that can be made when arriving at a decision on the basis of a 
hypothesis test: They are 
1. Type I error: If we reject a hypothesis when it should be accepted. we call that a type / error has 
been made. Hence, a type I error occurs when a true null hypothesis isereyeeted. 
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The value of & represents the probability of committing this type of error. 

That is, the probability of this error is written as: & = P(Type I Error) = P(H,| H, is true). 
The value of o represents the significance level of the test. 

Type I error is also called a rejection error and Type I error is called acceptance error. 

2. Type Il error: A type II error occurs when a false null hypothesis is not rejected. The value of B 
represents the probability of making a type II error. It represents the probability that H, is not 
rejected when actually H, is false. The value of (1 — 6) is called the power of the test. It represents 
the probability of not making a type II error. 

That is, B = P (Type Il error) = P (fail to reject H, | H, is false). 
Power = 1 - B = P (reject H, | H, is false). 

The two types of errors that occur in test of hypothesis depend on each other. The values of o and B 
cannot be lowered simultaneously for a test of hypothesis for a fixed sample size. Lowering the value of 
one of them will raise the value of the other. On the other hand, one can decrease both a and B 
simultaneously by increasing the sample size. 

The Type I and Type II errors are defined in terms of probability numbers and can be controlled to 
desired values. The results possible in testing a hypothesis are summarised in Table 6.1. 


Table 6.1: Type | (a) error and Type Il (6) error 


If the decision on analysis is: | 


Accept Hp 
Do not reject Hp 


Reject Ho 


ke lel is 


False and A; is true 
Wrong decision 

P = B (probability) 
Correct decision 
P=1-—B (probability) 
2P = 1.0 


True 
Correct decision 
P= 1—-—«a (probability) 
Wrong or incorrect decision 
P= a (probability) 
Se = 0 


Summarising, we have that when a statistical test procedure based on sample data will lead to precisely 
one of the following four situations. Two of these situations will entail correct decisions and the other two, 
incorrect decisions. 

I. H, is true and H, is not rejected — a correct decision. 

2. H, is true and H, is rejected — an incorrect decision. 

3. H,) is false and H, is not rejected — an incorrect decision. 

4 


H, is false and H, is rejected ~ a correct decision. 


64 LEVEL OF SIGNIFICANCE : : . 


In testing a given hypothesis, the maximum probability with which one would be willing or prepared to risk 
a Type I error is called the level of significance of the test. This probability often denoted by & is generally 
specified before any samples are drawn, so that results obtained will not influence the choice made. 


The general procedure in hypothesis testing is to specify a value of the probability of type I error a, 
often called the significance level of the test, and then design the test procedure so that the probability of 
type II error B has a suitably small value. 
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6.5 TAILS OF A TEST . 


The rejection region for a hypothesis-testing problem can be on both sides with the non-rejection region in 
the middle, or it can be on the left side or on the right side of the non-rejection region. A test with two 
rejection regions is called a two-tailed test, and a test with one rejection region is called a one-tailed test. 
The one-tailed test is called a left-tailed test if the rejection regions is on the left tail of the distribution 
curve, and it is called a right-tailed test if the rejection region is in the right tail of the distribution curve. In 


such cases, the critical region is a region to one side of the distribution with area equal to the level of 
significance. 


Table 6.2 gives critical values of z for one-tailed and two-tailed tests at various levels of significance. If 
under a given hypothesis, the sampling distribution of a statistic S is a normal distribution with mean qt and 
standard deviation 6. Then, the distribution of a standardised variable (or z-score) is given by = = (S — w)/o. 
is the standardised normal distribution (mean 0, variance 1) as shown in Table 6.1. Table 6.3 summarises the 
relationships between the signs in H, and H, and the tails of a test. 


Table 6.2: Critical value of z 


Level of significance, z 
—2.33 | -2.58 | -2.88 
Critical value of z for one-tailed test or or 
2.58.| 2788 
-2.81 | -3.08 
Critical value of z for two-tailed test or or 
2.81 | 3.08 


Table 6.3: Summary of the relationships between the signs in 
H, and H, and the tails of a test 


Two-tailed test | Left-tailed test | Right-tailed test 


Sign in the null hypothesis Ho 


ze 
In the left tail 


Sign in the alternative hypothesis H, 


Rejection region In both tails In the right tail 


Critical region Critical region 


Fig. 6.1: The distribution of Z,, when H,: » = py, is true 
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For a fixed sample size, the smaller we specify the significance level, a, the larger will be the probability, 
8, of not rejecting a false null hypothesis. Suppose that a hypothesis test is conducted at a small significance 
level, then if the null hypothesis is rejected, we conclude that the alternative hypothesis is true. If the null 
hypothesis is not rejected, we conclude that the data do not provide sufficient evidence to support the 
alternative hypothesis. 

To test a hypothesis, we devise a procedure for taking a random sample, computing an appropriate fest 
statistic, and then rejecting or failing to reject the null hypothesis H,. Part of this procedure is specifying 
the set of values for the test statistic that leads to rejection of H,. This set of values is called the critical 


region or rejection region for the test. 

The steps involved in performing a test of hypothesis are listed below: 

1. State the null hypothesis. Under H, give a specific value of the parameter. 

2. State the alternative hypothesis H.. It specifies a range of possible values for the parameter that is 
being tested. It is important here to decide whether the test is one-tail or two-tail. Rejection of H, 
leads to acceptance of H.. 

3. Select an appropriate test statistic. The selection of the test statistic is determined by the 
conventional point estimator of the parameter in question. 

4. Stipulate the level of significance, @, the probability of rejecting H, wrongly. The critical points are 
determined by the value of a. Formulate the decision rule based on step 2, that is, determine the 
values of the test statistic that will lead to the rejection of H, (the critical region). 

5. ‘Take a random sample and compute the value of the test statistic. 

6. Make a decision based on the decision rule formulated in step 4. The choice is either reject H, or 
not reject it. Finally translate the conclusions into simple language for the benefit of the uninitiated. 


HYPOTHESIS TEST ON THE POPULATION MEAN, STANDARD 


DEVIATION KNOWN 


A basic assumptions about the population here is that it is normally distributed. In the absence of a normally 
distributed population, we will require that the sample size be large, that is, at least 30. The level of 
significance is then approximately 100 o per cent. 


Suppose, the random variable X represents the population of interest. We assume that the distribution 
ot X is either normal or that, if it is non-normal, the conditions of the central limit theorem hold. 


Consider that the mean of of X is unknown but that the variance o* is known. Suppose, we are 
interested in testing the hypothesis 
A; W= 
H: peu, 6.1) 
where |, is a specified constant. 


A random sample size of n, X,. X,, ..., X,, is available. Each observation in this sample has unknown 


a 


mean pt and known variance o°. The test procedure for H,: 1 = LU, uses the test statistic 


» am 


FT vege ee 
0 o/Vn (6.2) 
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If the null hypothesis i; =u, is true, then E(X) = u,, and it follows that the distribution OFZ, sey 


(0, 1). Consequently, if H,: w= L., 1s true, the probability is 1 — @ that a value of the test statistic Z, falls 
between —Z_,, and Ling ain Z,,, 18 the percentage point of the standard normal distribution snch that 


EZ) = w2 (re al is the 100 v2 percentage point of the standard normal distribution). The situation 
is illustrated in Fig. 6.2. 


' i] t 
; 1  Donot : 
Reject H, Be cicci He Reject H, Reject H, Do not reject H, Do not reject H, Reject H, 
1 ‘ i] 
' 1 i) ; 
! i] i] ' 
1 1 i] i 
1 t t ' 
1 i] i] ! 
1 if i] 
a/2 o/2 an 
z Zz 
-Zw2 QO Zo/2 —Zo 0 0 Zo 
(a) Two-tailed - (b) Left-tailed (c) Right-tailed 


Fig. 6.2 


Note that the probability is that a value of the test statistic Z, would fall in the region Z, > Z_, or 
Z, <—Z,,, when H,: 4 = U, is true. We should reject H, if either 


Ze. (63) 
or ZL), (6.4) 
and fail to reject H, if 

MEST Lire (6.5) 


Equation (6.5) defines the acceptance region for H, and Eqs. (6.4) and (6.5) defines the critical region 
or rejection region. The type II error probability for this test procedure is a. 

A summary of the test criteria to test H,: 4 = 1, against the three forms of alternative hypothesis 1s 
given in Table 6.4. 


Table 6.4: Procedure to test on the population mean, standard deviation known 
Assumptions: 
1. Simple random sample 


2. Normal population or large sample 
3. 6 known 


STEP 1: The null hypothesis is Hp: = Mo, and the alternative hypothesis is 


Ay: UL # Uo Fg:  < Wo Hy: W > Lo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
STEP 2: Decide on the significance level, a. 
STEP 3: Compute the value of the test statistic 
X ~ Uo 


a—— 
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STEP 4: The critical value(s) are 
+Zu —Zo Za 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
Use the table in Appendix-E to find the critical value(s). 


; : ese 
RejectH, | Donot | RejectH) Reject Ho {| Do not reject Ho Do not reject Ho | Reject Ho 
| rejectHy |} H 

| J | 

| ! t 1 

{ I i} ! 

{ ij ( i) 

| I { i 

I i { i 

| J ro | a 

z z 

—Zon 0 Zor Za, 0 0 Zo. 

(a) Two-tailed (b) Left-tailed (c) Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 
STEP 6: Interpret the results of the hypothesis test. 


The hypothesis test is exact for normal populations and is approximately correct for large samples from 
non-normal populations. 


Example E6.1b 


The tensile strength of steel bars produced by a manufacturer has a mean of 30 MPa and the standard 
deviation of 1.5 MPa. By a new technique in the manufacturing process, it is claimed that the tensile strength 
can be improved. To test this claim, a sample of 50 steel bars is tested and it is found that the mean tensile 
strength is 30.5 MPa. Can we support the claim at 0.01 level of significance? 


SOLUTION: 
A decision has to be made between the two hypotheses 
H,: 4 = 30 MPa and there is really no change in tensile strength 
H,: « > 30 Mpa and there is a change in tensile strength 
A one-tailed test should be used here. At a 0.01 level of significance, the decision rule is: 


I. If the z-score observed is greater than 2.33, the results are significant at the 0.01 level and H, is 
rejected. The area between 0 and z, in Fig. E6.1 is 0.4900, and z, = 2.33. 


2. Otherwise, H, is accepted (or the decision is withheld). Under the hypothesis that H, is true, we 
find 


X -uy Bier!) 
o/vn  1.5/V50 


which are greater than 2.33. The decision rule is: Reject H, if the computed value of the test statistic is 
greater than 2.33. Because the computed value 2.357 is greater than 2.33, we reject H,. Hence, we can conclude 
that the results are “highly significant” and therefore the claim should be supported. 


Zo = 


= 2.357 
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Critical region 


2.33 


2.357 (computed value) 


Fig. E6.1 


Example E6.2 — : 


The manufacturer of a certain brand an automobile component claims that mean life of these components is 
65 months. An independent agency wants to check this claim and took a random sample of 36 components 
and found that the mean life for this sample is 63.75 months with a standard deviation of 4 months. 


(a) Using the 2.5% significance level, can we conclude that the mean life of these components is less 
than 65 months? 
(b) Make the test a part of (a) using 5% significance level. Would the conclusion be different trom the 
one in part (a)? 
SOLUTION: 
Here, a decision has to be made between two hypotheses. 
H,: & = 65 months 
H,: w< 65 months 
This is a left-tailed test. 
(a) For a=0.025, the critical value of z is —1.96 (From the table in Appendix-E). 


The test statistic is 


7 = FIM _ 66.015-66 __ 9g 


The decision rule is: Reject H, if the computed value of the test statistic is less than -1.96. Because the 
computed value —1.88 is greater than —1.96, we do not reject H,. 

Hence, do not reject H, and we conclude that the mean life of these components is not less than 65 months 
(see Fig. E6.2(a)). 
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; Do not 
Reject Ho] reject H, 


Computed value—1.88 


Fig. E6.2(a) 
(b) For a=0.05, the critical value of z= —1.65 (from table in Appendix-E). 


Do not 


Reject Ho} reject H, 


Computed value—1.88 


Fig. E6.2(b) 
From Part (a), the value of the test statistic is —1.88. The decision rule is: Reject H, if the computed value of 
test statistic is less than —1.65. Because the computed value —1.88 is smaller than —1.65, we reject H,. Reject 


H,, and conclude that the mean life of these components is less than 65 months. The decision in parts 
(a) and (b) are quite different (see Fig. E6.2(b)). 


Example E6.3 


ABC company produces steel rods that are supposed to be 66 cm long. The mean length of the rods 
produced when the machine is working properly is 66 cm. The standard deviation of the length of all rods 
produced on this machine is 0.05 cm. A sample of 40 such steel rods was taken each week, calculated the 
mean length of these rods, and tested the null hypothesis against the alternative hypothesis using a 1% 
significance level. If the null hypothesis is rejected, the machine is stopped and adjusted. Recently a sample 
of 40 steel rods produced a mean length of 66.015 cm. Based on this sample, can we conclude that the 
machine needs an adjustment? 


SOLUTION: 
Here Hy: 4= 66cm 
A: 44 66cm 


This is a two-tailed test. 


For = 0.01, the critical values of z are —2.58 and 2.58 (From the table in Appendix-E). 
The test statistic: 


Dea 00.015—66 190 
“  olvn 0.05/40 
The decision rule is: Reject H, if the computed value is less than —2.58 or greater than 2.58. Because. the 


computed value is less than 2.58, we do not reject H,. Hence, do not reject H, and we conclude that the 
machine does not need to be adjusted (see Fig. E6.3). 


Reject H, Do not reject H, Reject H, 


Fig. E6.3 


6.7 HYPOTHESIS TEST ON THE POPULATION MEAN, STANDARD 


DEVIATION UNKNOWN 


In tests of hypothesis regarding the population mean, the situation where the population standard deviation 
is not known is more realistic. The procedure presented here is particularly important when the sample size 
is small. The following assumptions are made: 

1. The observations are independent. 

2. The parent population has a normal distribution. 

With the above assumptions, the statistic has students f-distribution with » — | degrees of freedom, if 
H, is true. 

Suppose that X ~ N(u, 6”) and wish to test H,: = -,, and we do not known p or 6°. This test procedure 
depends on X being normally distributed, although slight departures from normally are not series, especially 


when the sample size is large. A random sample of size n is taken, and X and S are computed. 
Now, to test the two-sided hypothesis 
Ay: W=y, 
A We, 
The test statistic is formed, 


(6.5a) 
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which follows the t-distribution with n — | degrees of freedom if H, is true. H, is rejected if ltol > fap. Where 
vy =n—1. One-sided alternatives can also be formed in a similar manner. 


A summary of the test procedure to test H,: W = W, against the three alternative hypothesis is given in 
Table 6.5. 


Table 6.5: Procedure to test on the population mean, standard deviation known 


Assumptions: 
1. Simple random sample 

2. Normal population or large sample 
3. 6 unknown 


STEP 1: The null hypothesis is Ho: = [o, and the alternative hypothesis is 
Ag: WF Mo Hy: UW < Uo Hg: [b> Uo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
STEP 2: Decide on the significance level, ©. 
STEP 3: Compute the value of the test statistic 
_ Ho 
s/n 
and denote that value fo. 
STEP 4: The critical value(s) are 
+ty2 —lo lo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df =n -—1. Use the table in Appendix-G to find the critical value(s). 


t 


! : ' 
Reject Ho Do not Reject H, RejectHo | 0 not reject Ho Do not reject Ho | Reject Ho 
1 fejectHo | ; 
: : ! 
I ' 1 i 
' i] 
\ 1 ! ! 
) { { : 
) { : ' 
1 \ : - 
| U a a 
t t 
eed 0 
(a) Two-tailed (b) Left-tailed (c) Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Hp. 
STEP 6: Interpret the results of the hypothesis test. 


The hypothesis test 1s exact for normal populations and is approximately correct for large samples from 
non-normal populations. 


Example E6.4 


A manufacturing company sent ten of its employees for a special training course to increase their 


productivity. The following table gives the one-week production of these employees before and after they 
attend this course. 
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Before | 12 = 8 
After | 18 z Ll 
Using the 1% significance level, can a conclusion be made saying that “the mean weekly productions of all 


the employees increase as a result of attending this course”? Assume the populations of paired differences 
have a normal distribution. 


SOLUTION: 


Since the data are for paired samples, we can test the hypothesis about paired differences mean L, of 
population using the paired differences mean X of the sample. 


Let x = weekly output before the course — weekly output after the course. 


After 


es 


Let B= 4, — Hy 
Hy: ,=0 (UU, — H, = 0 or the mean output do not increase) 
A: u,<0 (u, — HW, < 0 or the mean output do increase) 
Since the sample size is small (< 30), the population of paired differences is normal, and p11, is unknown, we 
can use f-distribution to conduct the test. Degree of freedom =n -1= 10-1=9. 
The critical value of f is obtained from the table in Appendix-G, f,,, = 2.821. 


The test statistic f is 
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Since the test statistic t= 4.107 for X fall in the rejection region, we reject the null hypothesis. Therefore, 
we conclude that the mean weekly output for all employees increase as a result of the course. 


Reject H, Reject H 
0 


t-distribution 
10 degrees of 
freedom 


Computed value Critical value of t 


Fig. E6.4 


Example E6.5 


A manufacturer wants to determine whether the diameter of the steel rings gives satisfactory requirement. 
The particular ring will be considered satisfactory if the true diameter is greater than 20 cm. A field experiment 
was conducted in which 10 rings were tested at almost identical conditions and the mean diameter of the 
steel rings were computed for each. The results (in cm) were as follows: 


23,18,22 119.1 19.05, 22.1 18.25.15 0525-9 ameies ee 


Based on the data, is there sufficient evidence for the manufacturer to decide that the rings are satisfactory? 
The manufacturer is prepared to run a risk of Type I error of 0.05. It may be assumed that the diameter of 
the steel rings is normally distributed. 


SOLUTION: 

H,="=20 

H, => 20 
The standard deviation of the distribution is not known. We will have to estimate it from the sample. Hence, 
the test statistic is 


X¥=20 
Sin 


Here, & = 0.05, and since n = 10, we have 10 — 1 = 9 degrees of freedom. Thus, from the table for f-distribution 
(Appendix-G), we find f, ,,, = 1.833. The decision rule is: Reject H, if the computed value of the test statistic 
is greater than 1.833. It can easily verified from the data that 


=\2 
x= 205.4, X = 20.54, ae = 5.1643 
a 


Oo =5:1643 of y= 22795 
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Hence, the computed value of the test statistic is 


X-20  20,54-20 


= = | = 0.7514 (see Fig. E65 
sin 2.2725//10 a 


t-distribution 
9 df 


1.833 


0.7514 
Computed value 


Fig. E6.5 


Example E6.6 


A machine can adjusted so that when under control, the mean amount of fertilizer filled in a bag is 5 kg. To 
check if the machine is under control, 6 bags were picked at random and their weight (in kg) were found to 
be as follows: 

5.32, 5.19, 4.78, 5.18, 4.78 and 5.31 


At the 5% level of significance, is it true that the machine is not under control? (Assume a normal distribution 
for the weight of a bag). 


SOLUTION: 
Here o is not known and we will estimate it from the given sample. 
H,=p=5 
H,=p#5 
The test statistic is 
x-5 


SiNn 


We have a two-tailed test. Given & = 0.05 and n = 6,1, 45 = be o5 = 2-571 (from the table in Appendix-G). 
Hence, the decision rule is: Reject H, if the computed value is less than —2.571 or greater tham2.571': 


It can easily be verified from the given sample data that 


wer Oe 


n= 


= 5.09335 = 0.0623 and s = 0.2496 


Thus, the computed value of the test statistic is 


e270 Seas 


aie. es 00516 
siNn  0.2496/V6 
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Since the computed value is between —2.571 and 2.571, we do not reject H,. 


Conclusion: There is no significant evidence at the 5% significance level to indicate that the machine is not 
under control, so there is no need to adjust the machine (See Fig. E6.6). 


t-distribution 5 df 


0.025 


—2 571 0 t PAB 


0.9813 
Computed value 


Fig. E6.6 


6.8 HYPOTHESIS TEST FOR A POPULATION VARIANCE 


The following assumptions are made: 


1. The observations are independent. 


2. The parent population has a normal distribution. 


(184 


5 is chi- 
ae 


If the population is normally distributed with variance 6’, then the distribution of 


square with n — | degrees of freedom, where n represents the sample size. Hence, if H, is true, that is, if the 


(n-1)S? 
55 


population variance is o@, then the distribution of will be chi-square with n — | degrees of 


freedom. 


Here, we test the hypothesis that the variance of a normal distribution 0” equals a specified value of 


05 . Let X ~ M(u, 6°), where 4 and o° are unknown, and let X,, X,, ..., X, be a random sample of n observations 


from this population. To test 


Ho = Gi, 
H: 0 # 02 (66) 
We use the test statistic 
2 (n—1)S? 
Xo 0? (6.7) 


where the S? is the sample variance. Now if H,: or Ga is true, then the test statistic We follow the chi- 


square distribution with n — | degrees of freedom. Therefore, H_,: 6? = ee would be rejected if 
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Xo = Xo/2, n-1 (6.8) 
aS: 
Xo > Xi-a/2,n-1 (6.9) 
2 2 
where Xo/2.n-1 and Xj}_¢/2 n-1 are the upper and lower 0/2 percentage point of chi-square distribution with 


n — 1 degrees of freedom. 


For the one-sided hypothesis 


ey 2 
Hy. 8 =65 
ect =(G, (6.10) 
We would reject H, if 
Ce 
Xo > Xa, n-1 (6.11) 


For the other one-sided hypothesis 


ig ie o= ore 
H: 0? < 95 (6.12) 
We would reject H, if 
Faved ame (6.13) 


A summary of the test procedure to test H,: 0° = Ge against the three alternative hypothesis is given in 


Table 6.6. 


Table 6.6: Procedure to test for a population variance 


Assumptions: 
1. Simple random sample 
2. Normal population 


STEP 1: The null hypothesis is Ho: o= one , and the alternative hypothesis is 
HH: oO #0; lake Oo’ <0; i: G20, 
(Two-tailed) or (Left-tailed) or (Right-tailed) 

STEP 2: Decide on the significance level, a. 

STEP 3: Compute the value of the test statistic 


and denote that value % . 
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STEP 4: The critical value(s) are 


2 2 2 2 
Xi-a/2 and Xa/2 Xi-a Xa 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df= n-— 1. Use the table in Appendix-F to find the critical value(s). 

Reject Hy ! Do not 1 RejectH, RejectHo | Do not reject Ho Do not reject Ho Reject Ho 
i rejectHo | I ! 
! ] 
! 

a2 Ls eeu a 1 
t 
! 
2 is 1G 
2 2 
Ki-or2 Here Xa Ko 
Two-tailed Left-tailed _ Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 


STEP 6: Interpret the results of the hypothesis test. 


Example E6.7 


The copper content in an alloy material used to make a machine part is normally distributed, and the variance 
is 6° = 20. Test the hypothesis H,: 6° = 20 versus H,: 0° # 20, if a random sample of n = 10 machine parts 
yields a sample standard deviation of s = 4. Use a = 0.05. 


SOLUTION: 
o— 20, n= 10,5=4 
H,: 0? =20 
H,; & #20 
a= 0.05 
The test statistic is 


Nae = 2 
2 _(n=W)s* _ 10-14)" _ 5 4 


‘ 4 20 


The alternative hypothesis is two-sided so that the left is two-tailed. Now n = 10 and a = 0.05. 
Xn-1al2 = X5.0.025 


%n-11-0/2 = X5,097s [from the table in Appendix-F] 
6.05/2.n-1 = Xo.025,.9 = 19.022 [from the table in Appendix-F] 
he /2.n-1 = Kies = 2.70 [From the table in Appendix-F] 


2 oe 
X0.975,9 = Xo = Xo.025,9 
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Do not reject fi the data provide insufficient evidence to claim that the variance of the copper content of 
the machine parts is not 20 at a = 0.05. 


chi-square 
distribution 9 df 


0.025 


x 
0 2.70 7a 19.022 
Computed value 
Fig. E6.7 


Example E6.8 


The following measurements (in kg) were obtained on the weights of 6 manufactured metal parts picked at 
random: 12.1, 8.1, 7.2, 12.3, 14.05 and 13. At the 5% level of significance, is the population variance of the 
weight of such metal parts greater than 2.3? 


SOLUTION: 


H,.6= 2.3 =15166 
A: 0 > 1.5166 
The test statistic is 
(CE se 
23 


The alternative hypothesis is right-sided. Because n = 6, the chi-square has 6 — 1 = 5 degrees of freedom. 


From the table in Appendix-F, Vere = 11.07. Hence, the decision rule is: Reject H,, if the computed value of 


the test statistic exceeds 11.07. It can be verified that Xx = 66.75, X = 11.125 and S* = 7.7916. Therefore, the 
computed value of the test statistic is 


(n-1)S* _ (6-1)(7.7916) 
a 2.3 
Since the computed value 16.9382 is greater than 11.07, we reject H,. We conclude that at 5% level of 
significance, there is evidence indicating that the population variance exceeds pe, 


= 16.9382 (see Fig. E6.8) 


chi-square 
distribution 5 df 


0.05 


[6.9302 
Computed value 


Fig. E6.8 
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Example E6.9 


ee re en 


A restaurant claims that the standard deviation in the length of serving times is less than 3 minutes. A 
random sample of 23 serving times has a standard deviation of 2.3 minutes. At & = 0.10, is there enough 
evidence to support the restaurant’s claim? Assume the population is normally distributed. 


SOLUTION: 

The claim is “the standard deviation is less than 3 minutes”. 
H,; 6 23 minutes 

and H,: 6 <3 minutes 


The test statistic is 


(n=1)S? 
= 
Because the test is left-tailed test, the level of significance is @ = 0.10. There are 23 — 1 = 22 degrees of 
freedom and the critical value is 14.04 (from the table in Appendix-F). Therefore, the decision rule is: Reject 
H, if the computed value of the test statistic is less than 14.04. 
(n-1)S* _ (23-1)(2.3)° 


> 


= 12,9311 (see Fig. E6.9) 
Oo 


Because the computed value is less than 14.04, we reject foie 


Conclusion: There is enough evidence at the 10% level of significance to support the claim that the standard 
deviation for the lengths of serving times is less than 3 minutes. 


chi-square distribution 22 df 


Computed value 


Fig. E6.9 


6.9 HYPOTHESIS TEST ON A POPULATION PROPORTION 


Here, we treat the case of a qualitative variable, where the data are recorded as black-white, tall-short, 
successes-failures, defective-nondefective, etc. The purpose will be to test hypothesis regarding the 
proportion p of a certain attribute in the population. 


To carry out the test of hypothesis regarding the population proportion, we select a sample of 
observations and take the proportion of the attribute in the sample as statistic on which the test is based. 
We make the following assumptions: 
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1. The sample consists of n independent observations. 
The sample size is large. 


3. The hypothesised population proportion P, 18 not too close to 0 or | and is such that np, and 
n(1 — p,) are both greater than 5, 


If p is the proportion in the population, then the sample proportion si has a sampling distribution 
nh 


with mean p and standard deviation ./p(1— p)/n. 


If the sample size is large, both np and n(1 — p) are greater than 5, then the distribution X/n is 
approximately normal. If the null hypothesis is true, (population proportion, Py), then X/n will have a 


distribution that is approximately normal with mean p, and standard deviation Jpo(l— p)in . where np, 
and n (1 — p,) are assumed greater than S. 
Consider a binomial random-variable to be tested 
H,:P =P, 
H,: p' =P, 
If 0.1< p<0.9 and n large, the procedure described is acceptable. 


Pick a random sample of size n and compute p= X /n, where X represents the number of observations 


that are nonconfirming. If H, is true, then X ~ N[ po, Po(I1- po)/n]. To test p = p,, calculate 


7, = eee (6.14) 
Po(l— Po) 
i 


and reject H, if [Zo| > Zy;2. One-sided alternative could also be tested. 


The above test procedure is based on the application of the central limit theorem. Therefore, the 
significance level is approximately 100a per cent. 

Assuming of the test procedure to test H,: p # p, against the three alternative hypotheses is given in 
Table 6.7. 


Table 6.7: Procedure to test for a population proportion 


Assumptions: 
1. Simple random sample 
2. Both npo and n(1 — po) are 5 or greater 


STEP 1: The null hypothesis is Ho: p = po, and the alternative hypothesis is 
H,: p # Po H,: p < Po H,: p > Po 
(Two-tailed) or (Left-tailed) or (Right-tailed) 

STEP 2: Decide on the significance level, a. 
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STEP 3: Compute the value of the test statistic 
aa 
af Po(L— Po)/n 
and denote that value Zp. 
STEP 4: The critical value(s) are 
+Za/2 =<a, Zo. 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
Use the table in Appendix-F to find the critical value(s). 


Reject Ho | Do not Reject Hy Reject Ho } Do not reject Ho Do not reject Ho Reject Ho 
| rejectHo | : 
! | | ' 
| | | | 
) - 
! i} | | 
o/2 } 1 a/2 a ! ! a 
z Zz z 
—Zoy/2 0 Zol2 —Zo 0 0 Za 
(a) Two-tailed (b) Left-tailed (c) Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 
STEP 6: Interpret the results of the hypothesis test. 


Example E6.10 


An internet catalogue ordering company wants to deliver on time at least 90% all the orders received from 
its customers. A sample of 100 orders taken for inspection showed that 83 of them were delivered on time. 


(a) Using the 2% significant level, can we conclude the company’s delivery policy is maintained? 
(b) What will the decision be if the probability of making a type / error is zero? 


SOLUTION: 
(a) H,: p 20.90 
A: p< 0.90 
For & = 0.02, the critical value of Z is -2.05 (From the table in Appendix-E). 
E285 
= — = 0.83 
P* 100 
The test statistic is 
pD—- 0.83-0. 
Zs ee eee 5 333 


Jpqin  J(0.90)(0.10)/100 


Because & = 0.02, the decision rule is: Reject H, if the computed value is less than lig = —2505. 


The computed value —2.333 is less than —2.05. Reject H,. The conclusion is that the company’s 
policy is not maintained. 
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(b) If a =O, there is no rejection region. Hence, we cannot reject H, and cannot conclude that the 
company’s policy is maintained. 


Reject H,| Do not reject H, 


Computed values 


Fig. E6.10 


Example E6.11 


When a coin is tossed 198 times, it should heads 118 times. Is the coin biased in favour of heads? Conduct 
the test of hypothesis at the 5% level of significance. 


SOLUTION: 
Let p = probability of getting a head on a toss. Hence, we can state 
H,: p = 0.5 (really p < 0.5) 
HH: p>0.5 
Since p, = 0.5, the test statistic is 
P- Po 
V Po(l— po)/n 


a = 0.05, hence the decision rule is: Reject H, if the computed value is greater than 1.645. 


“0.05 


Since x = 198 and n = 198, the computed value of the test statistic is 


P- Po ~ 198 ___97 (see Fig. £6.11) 


VPoU-Ppo)/n  ¥(0.5)(0.5)/198 


The computed value 2.7 is greater than 1.645. Hence, we reject H,. We conclude that at the 5% level of 
significance, there is strong evidence that the coin is biased in favour of heads. 


Do not reject H,| Reject H, 


a = 0.05 


0 1.645 t 270 
Computed value 


Fig. E6.11 
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Example E6.12 


An independent survey found that 24% of the population are in favour of alcohol prohibition. You decide 
to test this claim and ask a random sample of 200 persons whether they are in favour of alcohol prohibition 
of the 200 persons, 28% are in favour. At & = 0.05, is there enough evidence to reject the claim? 


SOLUTION: 
H,: p = 0.24 (claim) 
H,: p #0.24 
Because p, = 0.24, the test statistic is 


P- Po 
V Po - Po)/n 


Because the test is a two-tailed test and the level of significance is a@ = 0.05, the critical values are 
~z, = -1.96 and z, = 1.96. Therefore, the decision rule is: Reject H, if the computed value is less than 
—1.96 or greater than 1.96. 
0.28 —0.24 
(0.24)(0.76) / 200 


N 
II 


= 1.324 (see Fig. 6.12) 


The computed value is between —1.96 and 1.96, and so we do not reject H,. 


Conclusion: At the 5% level of significance, there is not enough evidence to reject the claim that 24% of 
the population are in favour of alcohol prohibition. 


Do not reject H, 


Computed value 


Fig. E6.12 


6.10. HYPOTHESIS TEST ON EQUALITY OF TWO MEANS, VARIANCES 


KNOWN 


There are two normal populations (or two populations from which large samples are taken) with unknown 


means, say , and ,, and known variances Go: and O5 . To test the equality of the means, the following 
hypotheses are used: 

Hy; BW, =H, 

Ay: My # UL, 
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The test procedure consists of taking a random sample of size n, from the first population and 


compute X, . Similarly take a sample of size n, from the second population and compute X,. The test statistic 
is then given by 


X, =2G 

aaa (6.15) 
OO 
mM MN 


with critical region |Z, > Z.,.. 
The one-sided alternative hypotheses can also be used to test 
Hy: Wy = By 
A > ih, 
The test statistic given by Eq.(6.15) is computed, and H, is rejected if Z, > Z.. 
To test the other one-sided alternative hypotheses 
Hy: My =H, 
Ai, <p, 
The test statistic given by Eq.(6.15) is also used, but H, is rejected if Z, < —Z.,. 
The assumption of normal distribution of population is not important if sample sizes are large, that is, 


as a general rule, n, and n, are both at least 30. Also, here, even the population variances need not be 


known. The sample variance Ss and Ss can be used in place of respective variances G; and oO; , 


Table 6.8: Procedure to test equality of two means, variances known 


Assumptions: 

1. Simple random sample 

2. Independent samples 

3. Normal populations or large samples 


STEP 1: The null hypothesis is Ho: Hi = H2, and the alternative hypothesis is 


Agi Wi #2 Aa: ba < Ma Hy: by > Po 
(Two-tailed) or (Left-tailed) or (Right-tailed) 


STEP 2: Decide on the significance level, a. 


STEP 3: Compute the value of the test statistic 
x ia X 2 


_———————————— 
SoH meses ) 


Denote that value of the test statistic fo. 
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STEP 4: The critical value(s) are 


toy2 lo | lo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df = A, where 
a - USt mm) + (Sz mT" 
(Stim)? (Sz /my 
ia nN, -1 
rounded down to the nearest integer. Use the table in Appendix-G to find the critical value(s). 


’ 


Reject Ho | Donot | RejectHo Reject Ho | Do not reject Ho Do not reject Hp 1 Reject Ho 
i} ‘ ! ! 1 
; rejectHo | } i 
i 1 1 1 
i ! | 1 
i} | 1 ! 
i | ! i} 
1 1 1 1 
1 ! i] 1 
' 1 ! ! 
I | I i 
o/2 ! | o/2 a I \ ot 
t t t 
—tu2 0 twe —t 0 0 to 
(a) Two-tailed (b) Left-tailed (c) Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Hp; otherwise, do not reject Hp. 
STEP 6: Interpret the results of the hypothesis test. 


Example E6.13 


The following is summary statistics for independent simple random samples from 2 populations. Use the 
nonpooled t-test and the nonpooled t-interval procedure to conduct the required hypothesis test. 


Is there sufficient evidence to conclude that the mean in sample 1 exceeds that in sample 2? Assume 1% 
significance level. 


SOLUTION: 
Given X, =1158, X, =25,S,=79.4,S, = 10.5, n, =51,n, = 19. 
Ay BW, =p, 
AM, >, 
a=0.01 


The test statistic is 


LSB 25 


(79.4° /51) + (10.5° /19) 


pe 
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fs eats 


=. Se ee 


S)i [sf (may ceil 
mh Ny ot 
2) 6 en .: 


m—-1 | |n—1 (51-1) (19-1) 


The critical value = 2.397 (from the table in Appendix-G, after interpolation). 


The decision rule is: Reject H, if the computed value is greater than 2.397. Since the computed value 7.982 
is greater than 2.397, we reject H, (see Fig. E6.13). 


t-distribution 


Reject H, 


2.397 |7.982 
Computed value 


Fig. E6.13 


Hence, at 1% significance level, the data do not provide sufficient evidence to conclude the mean from 
sample 1 exceeds that in sample 2. 


The following is summary statistics for independent simple random samples from 2 populations. Use the 
nonpooled f-test and the nonpooled t-interval procedure to conduct the required hypothesis test. 


Si = L= 


At the 5% significance level, do the data provide sufficient evidence to conclude that the mean of sample 1 


smaller than sample 2? 


SOLUTION: 
A: U, =H, 
A: BW, <H, 
a= 0.05 


The test is a left-tailed test. 


The test statistic is 
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a 2 eS 
ny Ny 
a + —_ 


The critical value = —2.015 (from the table in Appendix-G). 


The decision rule is: Reject H, if the computed value of the test statistic is less than —2.015. Because the 
computed value —1.652 is greater than —2.015, we do not reject H, (see Fig. E6.14). 


At the 5% significance level, the data do not provide sufficient evidence to conclude that the mean is 
smaller with sample 1 then with sample 2. 


Reject H, |Do not reject H, 
t-distribution 


o/2 = 0.025 


—2.015 


-1.652 
Computed value 


Fig. E6.14 


Example E6.15 


The following is summary statistics for independent simple random samples from 2 populations. Use the 
nonpooled t-test procedure to conduct the required hypothesis test. 


At the 1% significance level, do the data provide sufficient evidence to conclude the means for the two 
samples are different? 


SOLUTION: 
Hy: W, =H, 
Ay, <p, 
a= 0.01 


/! Hypothesis Testing // 285 


——— a OHO ES LCSTEING 1 200 
The test statistic is 


esac econ 


and AlGo cs) ache] 


9\2 72 42 >\4 
Sass | 1.501 | 1.698? 
n, ny 14 14 
—— + a ws + 


Ri hl 14-1 14-] 


The critical value = +2.787 (from the table in Appendix-G). 


The alternative hypothesis is two-sided so that the test is two-tailed. The decision rule is: Reject H, if the 
computed value of the test statistic is less than —2.787 or greater than 2.787. Since 44.62 < —2.787, we reject 
H, (see Fig. E6.15). 


Conclusion: At the 1% significance level, the data provides sufficient evidence to conclude that the mean 
of the two samples are different. 


Reject H, —— Do not reject H, —— Reject H, 


t-distribution 


o/2 = 0.005 


4.62 | —2.787 0 2.787 
Computed value 


Fig. E6.15 


6.11 HYPOTHESIS TEST ON THE MEANS OF TWO NORMAL 
DISTRIBUTIONS, VARIANCES UNKNOWN 


We consider here the tests of hypotheses on the equality of the means p, and pL, of two normally 


distributions where the variances Gr and Oo; are unknown. A t-statistic is used to test the hypotheses. 


6.11.1 Case 1: 07 =03 =o" 


The following test procedures are particularly applicable for the case when small independent samples are 
drawn from normally distributed populations both having the same variance. 
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Let X, and X, be two independent normal populations with unknown means yw, and p,, and unknown 


. 4 9 . 
but equal variances, o7 = 0; =0° . We wish to test 


Fy By = Hy 
Ay: UM, #, (6.16) 
Suppose that X,,, X,,, ..-, Xi, is arandom sample of n, observations from X,, and X,,5/X55; --23 onan 


random sample of n, observations from X,,. 


Let X,, Xs and Se be the sample means and sample variances, respectively. Se and $3 estimate 


the common variance 6? and we can combine them to yield a single estimate 


2 _ (m=NS; + (m =1)S3 | (6.17) 
P Agee 2 


S 


To test H,: MW, = w, in Eq.(6.16), we compute the test statistic 


X LS X 2 
——— 
y Lae (6.18) 
nm Nn 
If H,: 1, = H, is true, f, is distributed as ¢,, , ,,-2- Therefore, if 
10 > 1ei2, mn, =2 (6.19) 
or if to <—ty/2,n +n, -2 (6.20) 
We reject Ai: W, = y 
The one-sided alternatives are treated similarly. To test 
AY, =H, 
AB, >, (6.21) 
compute the test statistic 7, in Eq.(6.18) and reject H,: 1, =p, if 
to > ban +n, -2 (6.22) 
For the other one-sided alternative, 
Ah: By = B, 
Ay: Uy <p, (6.23) 


calculate the test statistic 4, and reject A: , =p, if 


fo <—ba,m +n, -2 (6.24) 


2 ie summary of the test procedure to test H,: 4, = 1, against the three alternative hypotheses is given in 
able 6.9. 
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Table 6.9: Procedure to test on the means of two normal distributions, variances unknown 


Assumptions; | 
1. Simple random sample 
2. Independent samples 
3. Normal populations or large samples 
4. Equal population standard deviations 
STEP 1: The null hypothesis is Hp: 4; = Ma, and the alternative hypothesis is 
Ag: Wy # Wy Fg: Wy < Ue Aig: Wy > Wy 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
STEP 2: Decide on the significance level, a. 


STEP 3: Compute the value of the test statistic 
x a, ve 


S/n.) + O/ny) 


lon =1)S/ + (nm, —1)82 
where S, Be i ad ol Th I dal ; 
nm +n, -—2 


Denote that value of the test statistic is 


STEP 4: The critical value(s) are 
Hoy? —ty bo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df= n, + mz — 2. 
Use the table in Appendix-G to find the critical value(s). 


Reject Hp | Donot | Reject Hp Reject HH, | D0 not reject Ho Do not rejectHo | Reject Ho 
| fejectHo | ! : 
: | 
H i] 
| I i ! 
4 ’ ! 
I | i 
' I i} : 
o/2 o/2 ot 
t t t 
tue 0 tye —te 0 0 ty 
(a) Two-tailed (b) Left-tailed (c) Right-tailed | 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 
STEP 6: Interpret the results of the hypothesis test. 

The hypothesis test is exact for normal populations and is approximately correct for large samples from 
nonnormal populations. 
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Example E6.16 


The brands (A and B) of fertilizer were used by a company in an experiment. The following is summary 
statistics for the independent simple random samples from the 2 brands. 


Mean yield/plot | Standard deviation 
80 10 


88 


At the 5% significance level, are the two brands of fertilizers significantly different? It may be assumed that 
the yield per plot for the two brands of fertilizers are normally distributed with the same variance. 


Sample size 


SOLUTION: 
Ay: Wy =H, 
AM, AB, 
The test statistic is 
ie 


P 
mh Nh 


n, = 10,n,=15, X, =80, X, =88,S,=10, and S, = 20. 
The degrees of freedom are n, +n, — 2 = 10 + 15-2 = 23; = 0.05. Therefore ¢ = 2.069 (from the 


2), of — £30,005 
table in Appendix-G). The decision rule is: Reject H, if the computed value is less than —2.069 or greater 


than 2.069. 
(n, -1)S7 +(m -1)S3__ (10-1)(10)” + (15 -1)(20)? 


s = = 282.61 
sel a 10+15—-2 
or s, = 16.80 
The test statistic is 
80-88 


——_——— = -1.165 (see Fig. E6.16) 
16.8, = + a 
10 15 
Since the computed value —1.165 is between —2.069 and 2.069, we do not reject H,. 
We conclude that the data do not support the two brands are significantly different. 


Reject H, 4 Do not reject H, ——>i Reject H, 


a t-distribution 


o/2 = 0.025 J a= 0.025 


—2.069 0 2.069 
~1.165 


Computed value 


Fig. E6.16 
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Example E6.17 


=, mean life (in years) of certain machine part manufactured by two companies A and B are summarised 
Ow: 


Use the pooled t-procedure to conduct the required hypothesis test. At the 5% significance level, do the 
data provide sufficient evidence to conclude that the mean life of parts made by company A is less than 
that made by machine B? 


SOLUTION: 
Hy Wy =H, 
AU <1, 
a= 0.05 
The test statistic is 
xX,-X, 10-18.66 


cae Allee. 
Oise 4.772 el: 
nm Nn 10> 10 


where ea OS Is a0 me 
Why —2 LOO 2 
df=n,+n,—-2=10+10-2=18 
The critical value is —1.734 (from the table in Appendix-G). 
The decision rule is: Reject H, if the computed value of the test statistic is less than -1.734 (see Fig. E6.17). 
Because the computed value —-4.06 is less than —1.734, we reject H,. The data do not provide sufficient 
evidence to consider that, on the average, the mean life for parts made by company A is less than that made 


by Company B. 


Reject H, t-distribution 


Oj O08) 


=, = —1.734 0 


tate 
Computed value 


Fig. E6.17 
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Example E6.18 


I I ee 
In a particular company, the mean annual salary of male workers was $67,750 and that of female workers 
was $60,000 in 1999-2000. Assume that these two means are based on random sample of 28 male and 26 
female workers. The standard deviations for the two samples are $3.7 k and $3.2 k respectively. Using 1% 
significance level, can we conclude that the mean salary of all male workers in 1999-2000 was higher than 
that of all female workers? Further assume that the salaries of all male and female workers are both normally 
distributed with equal standard deviations. 


SOLUTION: 
Aon te—0 
eae 0 
o—O0l 


The alternative hypothesis is right-sided. The critical value of ¢ is 2.4 (from the table in Appendix-G after 
interpolation). 
_(n, -D)S; + (my - DSF _ (28-1)(3700)" + (26 - 13200)” 


p = 3468.6231 
Ait hy 2 254 20=2 


S 


X,-X - 
fie eet Mme CLE LU 


S, ae 3468.6231,]—— + 
m Mm 28 20 
The decision rule is: Reject H, if the computed value of the test statistic exceeds 2.4 (see Fig. E6.17). 


Because the computed value 8.205 is greater than 2.4, we reject H,. The mean salary of all male workers in 
1999-2000 was higher than that of all female workers. 


Do not reject H,| Reject H, 


t-distribution 


a=0.01 


0 2.4 13.2085 
Computed value 


Fig. E6.18 


6.11.2 Case 2: oj 40; 
In some situations, we cannot reasonably assume that the unknown variances on and 5 are equal. There 
is not an exact t-statistic available for testing A: UW, =H, in this case. However, the statistic 

i A 52 (6.25) 


m Nn 
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is distributed approximately as t with degrees of freedom given by 


See 

—+— 

ee 
we 


(Sp im)? (Sz! ny) (6.26) 
nm -1 nN -1 


if the null hypothesis H,: 1, = 1, is true. Therefore, if 67 # 03 , the hypothesis of Eqs. (6.16), (6.21), (6.23) 


are tested and to is used as the test statistic and n, + n, — 2 is replaced by n in determining the degrees of 


freedom in the test. 


Example £6.19 


A company claims that its high strength steel, type A provides higher tensile strength than another 
company’s steel, type B. A researcher tested both types of steel on two groups of randomly selected ones. 
The results of the test are given in the following table. The mean and standard deviation of tensile strength 
are in MPa. 


Sample size | Mean of tensile strengths | Standard deviation of tensile strength 


ve) MPa) 


(a) construct a 99% confidence interval for the difference between the mean tensile strength for the 
two types of steel. 
(b) test at a 1% significant level if the mean tensile strengths for type A is less than that for B. 


SOLUTION: 
S? Sz | 
nh MM 
(a) Degrees of freedom = 3 ante 
n ) re Ny 
pe i> — 1 


2 
E ie | 
=, + —— 
25 22 
3? ee 44 
| ae 
25 ZZ 
+ 
nem al 
The critical t-value (from the table in Appendix-G after interpolation), f, 9.4, = 2-692. 


dof. = 
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ae 13? Me 
X,-—X,)£t,{/—+— 
Hence, (X, 2) a5 


(40 — 45) + 2.692(3.5014) 
99% confidence interval for U1, — LL, is 
—5+9.43 or —14.42 to 4.4179 
Type A is between 14.42 MPa lesser and 4.42 MPa higher than type B. 


(b) A) 
Ae <0 
Degrees of freedom = 44 
too1.44 = ~2-414 (from the table in Appendix-G after interpolation). 
pHi =X) — Us -He) _ 40-45)-0 |, og 
Ss? sg 3.5014 
mn th 


Do not reject H,. The mean tensile strength for type A is not less than the mean tensile strength 
for type B. 


Reject H, | Do not reject H, eee 
t-distribution 


t 
t 0 
—1.428 
Computed value 


Fig. £6.19 


Example E6.20 


A manufacturing company is invested in buying one of two different types of machines. The company 
tested the two machines. The first machine was run for 8 hours and it produced an average of 129 items per 
hour with a standard deviation of 9 items. The second machine was run for 10 hours and it produced 120 
items per hour with a standard deviation of 6 items. Assume that the production per hour for each machine 


is approximately normally distributed. Further assume that the standard deviations of the hourly production 
of the two machines are unequal. 


Using the 2.5% significance level, can we conclude that the mean number of items produced per hour 
by the first machine is higher than that of the second machine? 
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SOLUTION: 
H.W, — 2, =0 
a — 0 
The appropriate statistic is 
Gene 
Se F Ss 
Nh Ny 
2 
nh Nn 
where ee ll 
oe aN 
n n 
te 
ny n, +1 
where S,=9, 5S, =6, n, = 8 and n, = 10. 


The alternative hypothesis is one-sided (right sided). Also, df= 11 and a = 0.025 so that the critical value of 
tis 2.201 (from the table in Appendix-G after interpolation). 


Hence, test statistic = _ = 2,429 


9 6 

— + — 

8 10 

Therefore, the decision rule is: Reject H, if the computed value of the test statistic is greater than 2.201. 
Because the computed value 2.429 is greater than 2.201, we reject H, (see Fig. E6.20). 

Hence, at the 2.5% significance level, there is evidence that the mean number of items produced per hour 
by the first machine is higher than that by the second machine. 


Do not reject H, | Reject H, 


t-distribution 


0 t, =-2.201 |2.429 
Computed value 


Fig. E6.20 
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Example E6.21 
A sample of 19 cans of Brand A soda gave the mean number of calories per can as 15 with a standard deviation 
of 1.920. Another sample of 16 cans of Brand B soda gave the mean number of calories 18.11 per can with a 
standard deviation of 2.250 calories. Assume that the calories per can of soda are normally distributed for each 
of these brands and that the standard deviations for the two populations are not equal. Test at the 5% 
significance level if the mean number of calories per can of soda are different for these two brands. 


SOLUTION: 


Let p, and pt, be the mean number of calories for all cans of soda of brand A and brand B, respectively. 
Given 
n, =8,n, = 10, S, =9 and S, =6. 


df= n Ny — 10 =99 
I | a a 
ny Ny 8 10 
+ + 


(™-D) (%-) @&-) d0o-) 


fe Fra Vyas S 0) 
jo eae w 
df=29 


Area to the right tail of the t-curve = 0.025. 
The test statistic is 
X,-X, _15-18.11 
2 2 2 2 
S; q S5 Ox ae 
nm MN 5 alld 


= — 4,353 


The alternative hypothesis is two-sided. The critical values of t are +2.045 (from the table in 
Appendix-G). Therefore, we have a two-tailed test. The decision rule is: Reject H, if the computed value is 
less than —2.045 or greater than 2.045. 


Because the computed value 4.353 falls in the rejection region, we reject H, (see Fig. E6.21). 


Reject H, — Do not reject H, —- Reject H, 


=0.02 ae ene 
ee a2 = 0.025 


4.353 


Computed value 


Fig. E6.21 


We conclude that there is difference in the mean number of calories per can for the two brands of soda 
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6.12. HYPOTHESIS TEST TO COMPARE TWO POPULATION MEANS 


(PAIRED t-TEST) 


The method of a paired t-test reduces the problem of comparing the means of two populations to that of a one- 
sample t-test. Here, we denote the difference of the Panes iby =.=. 


ihe n differences d,, d,, ..., d, constitute a sample of n independent observations from a population of 
within pair differences D”. We assume here that the differences D are normally distributed (especially when 
nis small). The population mean of paired differences is then equals to p., — W,. We are interested in testing 


H;; fies 
This is the same as testing the null hypothesis 
A: W, =0 
Now, we have a sample of n independent observations d,,d,, d,, ...,d, from a normal population whose 
variance is unknown, and we wish to test the hypothesis about its mean Us 
We can use the f-test with m — 1 degrees of freedom to test 
AL: Wu, = 0 
against the various alternative hypothesis. 


The test statistic is 


Ome dn 
syn Sq 


= Y7. 1 = 5 
where a] andes, =/—_ Ed, =) | 
n n-1 


The procedure for paired differences test of a hypothesis H,: 1, = 4, is summarised in Table 6.10. 


Table 6.10: Procedure to test to compare two population means (paired t-test) 


Assumptions: 
1. Simple random paired sample 
2. Normal differences or large sample 


STEP 1: The null hypothesis is Ho: [41 = b2, and the alternative hypothesis is | 
Hy: Wy F We Ay y= te Ag: Wy > 2 | 
(Two-tailed) or (Left-tailed) or (Right-tailed) 


STEP 2: Decide on the significance level, o. | 
STEP 3: Calculate the paired differences of the sample pairs. 
STEP 4: Compute the value of the test statistic 


oe d 
Sy IVn 
and denote that value to fo, also 


— kd; 1 = 
d =— and 54 = 4] —~ Xd; - 4)” 
n nt — a ds 
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— 
STEP 5: The critical value(s) are 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df=n—1. 
Use the table in Appendix-G to find the critical value(s). 
Reject Ho Do not Reject Ho Reject Ho Do not reject Ho Do not reject Ho Reject Ho 
1 rejectHy  ! ' H 
I ‘ i H 
| 
I ' ' ! 
| 1 1 1 
] ! 1 ! 
I ' ' ! 
! ' ! t 
I ' 1 H 
a/2 a2 ¢ : a 
t t t 
—tue 0 tw to 0 0 ta 
(a) Two-tailed (b) Left-tailed (c) Right-tailed 
STEP 6: If the value of the test statistic falls in the rejection region, reject Hp; otherwise, do not reject Hp. 
STEP 7: Interpret the results of the hypothesis test. 
The hypothesis test is exact for normal differences and is approximately correct for large samples and 
nonnormal differences. 


Example E6.22 


Table E6.22 gives the systolic blood pressures of seven adults and after completion of a special dietary 
plan based on a special dietary plan for three months. Using the 5% significance level, can we conclude 
that the mean of the paired differences is different from zero? Assume that the population of paired differences 
is approximately normally distributed. 


SOLUTION: 


Difference, d 
119) 
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= 10.8430 


Ay: M, # M(H, = 0) 
Ai: 4,40 
The test statistic is 


de 2857 
sivn_ 10.8430./7 
d 


The alternative hypothesis is two-sided. Therefore, we have a two-tailed test. df=n-1=7-1=6and 
o/2 = 0.025. 

The two critical values of ¢ for df= 6 and @ = 0.025 are —2.447 and 2.447 (from the table in Appendix-G). 
The decision rule is: Reject H, if the computed value is less than —2.447 or greater than 2.447. 


1.2898 


Because the computed value 1.2898 for d falls in the non-rejection region as shown in Fig. E6.22 and 


we fail to reject the null hypothesis. 
Reject H, —— Do not reject H, —>i Reject H, 


t-distribution 


w2=0.025 j §  g/2 = 0.025 


Computed value 


Fig. E6.22 


A college claims that the tutoring service it offers significantly increases the test scores of students in 
mathematics. The following table gives the scores out of 120 of 8 students before and after they took the 


tutorial help. 


[Before | 82 [75 | 89[ 91 [66] 70] 91 | 69 | 
TAfter_| 97 [72 [94 [ 111 | 80 | 72 | 117 | 76 | 


% significance level, can you conclude that taking tutorial service increases the test scores of 
proximately normally distributed. 


Using the 5 ih : 
the students? Assume that the population of paired differences 1s ap 
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SOLUTION: 
n=8 


Difference, d 
82 97 15 


The population statistic is 


d _ -10.75 _ 
sywn 9.70648 


The alternative hypothesis is one-sided (right sided). Also a = 0.05. The critical value of t = —1.895 
(from the table in Appendix-G). 


=s lees 


Therefore, the decision rule is: Reject H, if the computed value of the test statistic is greater than —1.895. 


Because the computed statistic —3.1325 is greater than —1.895, we reject H, (see Fig. E6.23). We conclude 
that attending the tutoring service increases the mean test scores. 


Reject H, Do not reject H, 


t-distribution 


—3.1325 
Computed value 


Fig. E6.23 
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Example E6.24 


A medical agency measured the corneal thickness of 8 patients who had glaucoma in one eye but not in the 
other. The following are the data on corneal thickness, in microns. 


At the 10% significance level, do the data provide sufficient evidence to conclude the mean corneal thickness 
is greater in normal eyes than in eyes with glaucoma? 


SOLUTION: 


n=8 


Patient 


= 10.4847 


The test statistic is 


d 4.25 


pe es 
syvn  10.4847V8 

Hy: B, =, 

A: 4, > Hy 


a =0.10, df=7, critical value = 1.415 (from the table in Appendix-G). 


The alternative hypothesis is one-sided (right sided). Also, the decision rule is: Reject H, if the computed 
value of the test statistic is greater than 1.415. 
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Because the computed value 1.1465 is less than 1.415, do not reject H, (see Fig. E6.24). 


Hence, the data do not provide sufficient evidence at the 10% significance level that the mean corneal 
thickness is greater in normal eyes than in eyes with glaucoma. 


Do not reject H, | Reject H, 


t-distribution 


Computed value 


Fig. E6.24 


6.13 HYPOTHESIS TEST ON THE EQUALITY OF TWO VARIANCES 


The two independent populations are X, ~ M(u,, o; ) and X, ~ MLL, o5 ) where L.,, Gi, and U,, G. are 


unknown. We wish to test hypotheses about the equality of the two variances, say H,: Gi; = 65 . Assume 
that two random samples of size n, from population | and of size n, from population 2 are available, and let 


s? and s3 be the sample variances. To test the two-sided alternative 


H,: 6; =03 
H,: of #03 (6.27) 
The statistic 
SF 
fa 3 (6.28) 


is distributed as F, with n, — 1 and n, — | degrees of freedom, if the null hypothesis dd. G7 = 5 is true. 
Hence, we would reject H, if 


P= Tae (6.29) 
or if F rae errs es Mes (6.30) 
where Fo/2.n,-1.n,-1 and Fi_g/2, n-l.n,-1 are the upper and lower o/2 percentage points of the F-distribution 


with n, — 1 and n, — | degrees of freedom. Appendix-G gives only the upper tail points of F, so to find 
F\_«/2, n,-1,n,-1 We must use 


F - 1 
1-@/2,n,-1,%)-1 FT Go 7 
0/2, ~1,n,-1 


(6.31) 
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To test one-sided alternative hypothesis, let X, denotes the population that may have the largest 


variance, then the one-sided alternative hypothesis is 


ey ee 
H,: 0; =0} 


IO. > Os (6.32) 


If Ee ies 


: ee) 
We would reject H,: o; = oO; : 


A summary of the test procedure to test ite Oo; = 05 against the three alternative hypothesis is given 


in Table 6.11. 


Table 6.11: Procedure to test on the equality of two variances 


Assumptions: 
1. Simple random sample 
2. Independent samples 
3. Normal population 


STEP 1: The null hypothesis is Ho: 6; = 62, and the alternative hypothesis is 
H,: 0; #02 Hg: 0; < 02 Jel (On SS (oh 
(Two-tailed) or (Left-tailed) or (Right-tailed) 


STEP 2: Decide on the significance level, a. 
STEP 3: Compute the value of the test statistic 


and denote that value Fo. 
STEP 4: The critical value(s) are 
F\_q and Foy2 Fo. Fo 
(Two-tailed) or (Left-tailed) or (Right-tailed) 
with df = (n; — 1, n2 — 1). Use the table in Appendix-H to find the critical value(s). 


Reject Hp ; Donot reject | RejectHo RejectHp ; Do not reject Ho Do not reject Hp | Reject Ho 

| Ho ! ! ' 
' 1 { | 
! ! ' i 
i 1 ' ‘ 
! ) ! ( 
| | ! : 

a2 | | 1 a2 a |} tg 
' | 

ie [F = 
Fi-we Fue Paes “ 

Two-tailed Left-tailed Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 


STEP 6: Interpret the results of the hypothesis test. 


(6.33) 


| 
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Example E6.25 


A company has two branch operations, A and B, producing a particular product. Branch A has 15 employees 
while branch B has 25 employees on any given month, although there was no significant difference in 
mean production, branch A had a standard deviation of 8 while branch B had a standard deviation of 10. 
Can we conclude at the (a) 0.01, (b) 0.05 level of significance that the variability of branch B is greater than 


that of branch A? 


SOLUTION: 
n= IS), n,= 25 
(a) We have, on using subscripts | and 2 for branches A and B respectively, S, = 8, S, = 10, so that 
Ary nh 5 15 


S? = 8") = 68.571 
is-p®? 


ny §2 25) 
aioe iS i) 


(107 ) = 104.167 


We have to decide between the hypotheses 

H,: 6, = 6,, and any observed variability is due to chance. 

H,: 6, > 6,, and the variability of branch B is greater than that of A. 

The decision must, therefore, be based on a one-tailed test of the F-distribution. Hence, 


_ $3 _ 104.167 
§? 68.571 
v,= 25-1] = 24; v,=15—1= 14. At 0.01 level for 24, 14 degrees of freedom, we have from 
Appendix-H, F,,, = 3.43. 

Then, since F < F,,, we cannot reject H,, at the 0.01 level. 


(b) F...=2.35 for 24, 14 degrees of freedom (see Appendix-H). 


0.95 


=1.519 


We must note again that F< F,,.. Thus, we cannot reject H, at the 0.05 level. 


Do not reject Ho Reject H, 


F=3. 
F= 1.519 ue 


Fig. E6.25 
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Example E6.26 


Patients who undergo chronic hemodialysis often experience severe anxiety. Videotapes of progressive 
relaxation exercises were shown to one group of 31 patients and neutral videotapes to another group of 25 
patients. A psychiatric questionnaire was used to measure anxiety where higher scores correspond to higher 
anxiety. The standard deviations from the anxiety measurement were 11.0403 and 10 respectively. Do the 
data provide sufficient evidence to conclude that variation in anxiety-test scores differs between patients 
who are shown videotapes of progressive relaxation exercises and those who are shown neutral viedeotapes? 
Perform an F-test at the 10% significance level. 


SOLUTION: 


n, =31 and n, = 25 


he 0, =0, 
1a CG 
O00) 


and test statistic 


_ Sv _ 11,0413? 


F 
Se 107 


= 1.2189 


| 
with df= (24930) = —- =063 


The critical values are F,,,, with df = (30, 24) or 1.94, and F,,. = 139 


0.05 
[see the table in Appendix-H]. 
The alternative hypothesis is two-sided. Therefore, we have a two-tailed test. The decision rule is: Reject 
H, if the computed value is less than 0.53 and greater than 1.94. 
Because the computed value 1.2189 is 0.53 < 1.2189 < 1.94, do not reject H, (see Fig. E6.26). 


Hence, there is not sufficient evidence at the 0.10 significance level to obtain that the variation in anxiety 
test scores for patients seeing videotapes showing progressive relaxation exercises 1s different from that in 


patients seeing neutral videotapes 


Reject Ho! Do not reject H, Reject H, 


F-distribution 


Fig. E6.26 
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Example E6.27 

An independent golf equipment testing facility compared the differences in the performance of golf balls 
hit off a regular wooden tee to those hit off a brand A golf tee. Tests were conducted and a robot swung 
the club head at approximately 150 kilometer per hour. Data on ball velocity (in kilometers per hour) with 


each type of tee are as follows: 


Brand A Regular 
X, =204 X, = 196 
S, = 0.368 5, =0.8 

n, = 30 n, = 30 


At the 1% significance level, do the data provide sufficient evidence to conclude that the standard 
deviation of velocity is less with Brand A tee than with the regular tee? 


SOLUTION: 


The test statistic is 


ey ; : 1 
The critical value is F,,, with df= (29, 29) or 242 = 0.41 [Note that F,,, = 2.42] from the table in 


Appendix-H. Because the alternative hypothesis is one-sided (on the left), the decision rule is: Reject H, if 
the computed value of the test statistic is less than 0.41 (see Fig. E6.27). 
Because the computed value 0.211 is < 0.41, we reject the null hypothesis H,. 

There is sufficient evidence at the 0.01 significance level to claim that the standard deviation of ball 
velocity is less with the Brand A tee than with the regular tee. 


Reject H, | Do not reject H, 


F-distribution 


a=0.01 


Fig. £6.27 
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6.14 HYPOTHESIS TEST ON TWO PROPORTIONS 


Quite often, a are required to decide whether the observed differences between two sample proportions is 
due to sampling error or due to the fact that the proportions in the parent populations from which the 


samples are drawn are inherently different. Let p, and p, represent the proportions of those infected in the 
two populations. 


The two binomial parameters are p, and p, and we with to test that they are equal. That is, H,: p, = p,. 

If the two random samples of sizes n, and n, are taken from two populations, and X, and X, represent 
the number of observations that belong to the class of interest in sample | and 2, respectively, then assume 
that the normal approximation to the binomial applies to each population, so that the estimators of the 
population proportions p, = X,/n, and p, = X,/n, have approximate normal distribution. Now, if the null 


hypothesis H_,: p, =p, is true, then using the fact that P, =P, = p, the random variable 


= We 
ag! 
(UY 20) pe (6.34) 
nh th 
is distributed approximately (0,1). An estimate of the common parameter p is 
Ny +N 
The test statistic for H,: p, = p, is then 
Zy = Pi — P2 
E ail 
PP) Gye) 
nh Nn 
If A228, 1 2.< 7, (6.36) 


the null hypothesis is rejected. 
A summary of the test procedure to test H,: p, = p, against the three alternative hypotheses is given in 
Table 6.12. 


Table 6.12: Procedure to test for comparison of two population proportions 


Assumptions: 
1. Simple random samples 
2. Independent samples 
3. x1, 21 —X}, and nz — x2 are all 5 or greater 
STEP 1: The null hypothesis is Ho: p; = p2, and the alternative hypothesis 1s 
Hz: pi # P2 H,: pi < p2 H,: pi > P2 
(Two-tailed) or (Left-tailed) or (Right-tailed) 


STEP 2: Decide on the significance level, &. 
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STEP 3: Compute the value of the test statistic 
= Py, — Pr 


~ Spd- p)Jfd/n, + Amn ) 


where Pp = (x; + x2)/(n; + m2). Denote the value of the test statistic Zo. 


STEP 4: The critical value(s) are 
+Z0/2 —a La 


(Two-tailed) or (Left-tailed) or (Right-tailed) 
Use the table in Appendix-E to find the critical value(s). 


Reject Ho Donot j{ RejectHo Reject Ho | Do not reject Ho Do not reject Ho Reject Ho 
1! rejectHp | 
| | ! | 
1 t ! i 
! 1 ' | 
! i] ) | 
! ! ' | 
| ' ! 4 
| ' ' | 
we | 2 a! 0 
z zZ z 
—Zal2 QO Zu Zo. 0 0 Zo. 
(a) Two-tailed (b) Left-tailed (c) Right-tailed 


STEP 5: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ho. 


STEP 6: Interpret the results of the hypothesis test. 


Example E6.28 


In an independent study it was found that a sample of 500 male registered voters showed that 60% of them 
voted in the last general elections. Another sample of 400 female registered voters showed that 50% of 
them voted in the same election. Test at the 1% significance level if all male voters who voted in the last 


general election is different from that of all female voters. 


SOLUTION: 
A: p,—p,=9 
Hep =p 
P,=0.60, p, =0.50, n, = 500 and n, = 400 
For & = 0.01, the critical values of z are -2.58 and 2.58 (from the table in Appendix-E). 


oe 500(0.60) + 400(0.50) 
500 + 400 


i i 1 
—+—|= (osss6yo4asn| 1 x)- 
+4] J= ) 500 * 400 = 0.03333 


uv 


= 0.5556, p=1- p =1-0.5556 =0.4444 
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Hence z= {Pi Pa) (1= Pr) _ 0.60~-0.50 _, 
Si 0.03333 
enone 

nh Ny 


Hence, we reject H, and conclude that the proportion of all male voters who voted in the last general 
elections is different Con that of all female voters who voted in the same election (see Fig, EGi28): 


Reject H, — Do not reject H, er Reject H, 


~Zy2 '-2.58 0 2.58 | Z,)0 
Two-tailed 
Fig. E6.28 


Example E6.29 


A survey found that 44% male students and 31% female students in a particular college had more than one 
hour of physical exercise time per day. Suppose that these percentages are based on a random sample of 
395 male students and 410 female students. At the 1% significance level, can we conclude that the percentage 
of all male students and of all female students who have more than one hour of physical exercise time per 
day are different? 


SOLUTION: 
Here H,: p,— Pp, =9 
Hp. —p, +0 
For of = 0.01, the critical values of z are —2.58 and 2.58 (from the table in Appendix-£). 
p, = 0.44, p, =0.31, n, =395, and n, = 410 


_(r-py)-(i-=p2)___(.44-0.3)-0 gg, 
ae (0.374)(0. 626)( 51. : a 
395 410 


= be H+% _ 0.44(395) + 0.31(410) ~ 0.374 


ny +N 395+410 


Hence, z = 3.81 and we reject H, and conclude that the percentage of all male students and all female students 
who have more than one hour ae physical exercise time per day are different as shown in Fig. E6.29. 
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Reject H, aes Do not reject H, —> Reject H, 


=z 


a2 


—2.58 0 ZI5SEZe, 
Two-tailed 


Fig. E6.29 


Example E6.30 


An independent medical research team conducted a study to test the effect of a cholesterol reducing medication. 
At the end of study, the team found that of the 9400 subjects who took the medication, 602 died of heart 
disease. Of the 8600 persons who took brand A, 714 died of heart disease. At & = 0.01, can we conclude that 
the death rate is lower for those who took the medication than for those who took the brand A? 


SOLUTION: 
Hp 2p; 
H,: p, < p, (claim) 


Because the test is left-tailed and the level of significance is & = 0.01, the critical value is —2.33 (from the table 
in Appendix-E). The decision rule is: Reject H, if the computed value of the test statistic is less than —2.33. 


5 eee (gd 
n, 9400 
py) =2 = 14 _ 0083 
n, 8600 
ee + 
p= co) a ee OE a 7s 
m+n, 9400+ 8600 
g =1- p=1-0.073=0.927 
The test statistic is ; 
D, — p 0.064 — 0.08 
go ee Di Pe on ee eee —4,8989 
5 ae Ade 1 1 1 
J Ppd- p),/—+— coorsyo.s2n{ 5 + waa) 
ny 9400 8600 


Because the computed value —4.8989 is less than —2.33, we reject H, (see Fig. E6.30). At 1% level, there 


is enough evidence to conclude that the death rate is lower for those who took the medication than for 
those who took the brand A. 


/! Hypothesis Testing // 309 
A pothesis Testing // 309 


Reject H, {Do not reject H, 


z-distribution 


a =0.01 


4.8989 
Computed value < 


Fig. E6.30 


Example E6.31 


en 
A survey of 872 workers showed that 384 of them said that it was seriously unethical to monitor employee 
e-mail. When 242 senior-level managers were surveyed, 80 of them said that it was seriously unethical to 
monitor employee e-mail. Use a 0.05 significance level to test the claim that for those saying monitoring 
e-mail is seriously unethical, the proportion of employee is greater than the proportion of managers. 


SOLUTION: 
ee 404 
n 87 
NES GF 
ips Beat) 
atc: 
872+ 242 
lek P,-P,= 0 
pap, oO 
a= 0.05 


The critical values of z, = 1.645 (from the table in Appendix-E). 


The alternative hypothesis is one-sided (right-side). Therefore, the decision rule is Reject H, if the computed 
value of the test statistic is greater than 1.645. 


The test statistic is 


z= PD, = Py ee 0.4404 — 0.3306 Es 3.0654 
ee eae 
pil a pls _ + 0.4165(0.58 ( 872 242 


Because the computed value 3.0654 is greater than 1.645, we reject H, (see Pigs). 


_ Hence, there is sufficient evidence to conclude that p, — p, > 0. 
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Do not reject H, Reject H, 
z-distribution ! 


0 z,=1.645 43.0654 
Computed value 


Fig. £6.31 


Procedures for testing hypotheses on means, variances, and proportions for a single population and two 
populations are summarised in Tables 6.13 and 6.14 respectively. 


Table 6.13: Summary of tests of hypotheses (single population) 


No.| Nature of the population |Parameter} Null Alternative| Test statistic | Decision rule: Reject 
hypothesis | hypothesis the computed value 


1. | Quantitative data; UW iy 1. U> Uo . greater than Z,, 
variance oO” is known; 2G 2. less than —Z¢, 
population normally 3. WF Uo . less than —Zq/2 or 
distributed sreater than Zo) 


Quantitative data; 
variance O°” is not known; 
population normally 
distributed; sample size 
small (n < 30 
Quantitative data; 
variance O° is not known; 
population not necessarily 


. greater than t,-1,0 
2. less than — fr_1.¢ 
. less than — f,_1,0/2 OF 
greater than t,1,02 


. greater than 2, 
2. less than —Z,, 
3. less than —Zg2 or 


normal; sample large greater than Zo 
(n = 30) The level of 
significance 


approximately 100a 
per cent 


4. | Quantitative data; 
population normally 
distributed 


. greater than oar 


2. less than he ties 


3. less than Neale oe 
or greater than 


2 
Xn-1,0/2 


Quantitative data; 


| pp, . greater than Z,, 
binomial case 


fe 2. less than —Zy 

it 3. less than —z2 or 
greater than Z,2 

The level of 

significance 

approximately 100c 

per cent 
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Table 6.14: Summary of tests of hypotheses (two populations) 


No. | Nature of the 
population 


Test statistic 


Decision rule: 
Reject Hp if the 
computed value is 


Quantitative 
data; 
variances 
a; and or are 
known; both 
populations 
normally 
distributed 
Quantitative 
data; 07 , 05 
unknown; 
populations 
not 
necessarily 
normal; 
sample sizes 


: 


. greater than Z, 
. less than —zo, 
. less than —Zyp 


or greater than 
Zo/2 


. greater than Za 
2. less than —zq 
. less than —Zgn 


or greater than 


Za? 


are large 
Quantitative Ly — Wo . greater than 
data; o7 and tman-2.0 

5 . less than 
O; are not _ 

2 bnan2.0 
known, . less than 
assumed to be — fee OF 
equal; Both ary 2 2 Z h 

ha (S-ny oS (my) greater than 
populations ane = oa ee 1 Reese 
ll 1 2 
are normally 
distributed. 


Quantitative [Ly — Po . greater than 
data; o; and | ade 
05 are Not — tm+n2,.0 
known but are . less than 
assumed to be — tm+n-2.0/2 OT 

greater than 

(ny - 1S? + -)S3 tspeasare 
ny +n, —2 

distributed. 
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5. |Paired oleae =O | 0 avn 1. greater than 
observations (=Up) TCs 2. Up) 54 tn-1,0. 
Uy = Up 3. Up #0 a 2. less than 
where d = 2Xd,/n te 10 
i _, |3. less than 
and So = iy st d)” — t-1.a/2 OF 
ae greater than 
Ty-1,0/2 
6. |Quantitative G; 26 GC, =O, l- GC; >o7 . Ws Fo? Fg 2nieet 
; 2 ; . ae. i 5, 
data: Ly, 0; ? ‘O) O; 0, : S5 Fo <a Fo 2,m,-1hny-t 
U2 and > are 3, o ze 3 
unknown; Fo > Fain -tn,-1 
Both 
populations 
are normally 
distributed. 
7. | Qualitative Pi-p Pi=pP2 1. pi > p2 Pi - Po 1. greater than zq 


data; binomial 
case 


6.15 SUMMARY | 


2. Pi < p2 
3. Di # pr 


lew 

jap ds 

ny Ny 
ae + X> 
where j= 
ny + Ny 


Both n, and n> are assumed 


large 
nS X 
Pia aes 
ny 
A X> 


2. less than —Z¢, 

3. less than —zo,2 
or greater than 
Zo/2 

The level of 
significance 
approximately 
100a per cent 


a 


In this chapter, we introduced the second topic in inferential statistics: tests of hypotheses. In a test of 
hypothesis, we test a certain given theory or belief about a population parameter. Based on some sample 
proportions, we find out whether or not a given claim (or a statement) about a population parameter is true. 


We discussed how to make such tests of hypotheses about the population mean, u and population 
proportion, p, for a single population. The hypotheses-testing procedures have been extended to the 
difference between two population means and the difference between two population proportions. 
Constructing confidence intervals and testing hypotheses about population parameters are referred to as 


making inferences. 
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P6.1 


P6.2 


P6.3 


P6.4 


P6.5 


P6.6 


P6.7 


P6.8 


P6.9 


Determine the critical values for a hypothesis test at the 1% significance level (~ = 0.01) if the test 
is 


(a) two-tailed 

(b)  left-tailed 

(c) right-tailed 

Determine the critical values for a hypothesis test at the 5% significance level (ct = 0.05) if the test 
is 

(a) two-tailed 

(b)  left-tailed 

(c) right-tailed 

Determine the critical values for a hypothesis test at the 0.5% significance level (co = 0.005) if the 
test is : 

(a) two-tailed 

(b) left-tailed 

(c) right-tailed 

Determine the critical values for a hypothesis test at the 0.2% significance level (a = 0.002) if the 
test is 

(a) two-tailed 

(b)  left-tailed 

(c)  right-tailed 

The breaking strength of a machine part is required to be at least 75 Pa. Based on previous test 
data, the standard deviation of breaking strength is 6 Pa. A random sample of 16 specimens is 
tested and the average value is found to be 73 Pa. 

(a) state the hypotheses that need to be tested 

(b) test the hypotheses using & = 0.05. 

The diameter of copper shafts produced by a certain manufacturing company process should have 
a mean diameter of 0.51 mm. The diameter is known to have a standard deviation of 0.0002 mm. A 
random sample of 40 shafts has an average diameter of 0.509 mm. Test the hypotheses on the 
mean diameter using o = 0.05. 

Refer to Problem P6.7. Determine the sample size required to construct a 95% confidence interval 
on the mean that has a total width of 2.0. 

The mean life span of population of light bulbs manufactured by a particular company is found to 


be 2400 hours with a standard deviation of 250 hours. A sample of 100 light bulbs produced in a 
lot is found to have a mean life span of 2300 hours. Test whether the difference between the 


population and sample means is statistically significant. Assume & = 0.05. 


(a) nan experimental lab, 10 students are selected at random from a class and were assigned to 
set up an experiment. Their time to complete the experimental set up are recorded (in minutes) 
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is 51, 55, 54, 50, 56, 59, 57, 59, 58, 60. Would it be appropriate to suggest that the mean 
completion time for the given experimental set up in the student population as 58.2 minutes? 
Given a = 0.05. 


(b) The mean weight of all basketball players in a particular college is 69.3 kg. A random sample 
of 20 such players produced a mean weight of 70.0 kg with a standard deviation of 2) kee 
Assuming that the weights of all college basketball players are normally distributed, test at 
the 1% significance level if their mean weight is different from. 


A study claims that students in a particular college spend an average of 18 hours per week on 
leisure activities. The administration wanted to test this claim. A sample of 10 students selected at 
random gave their responses as follows (in hours): 


14, 16, 19, 22, 23, 25, 26, 33, 38, 41 


Assume that the time spend on leisure activities by all students is normally distributed. Using the 
5% significance level, can the administration conclude that the claim of the earlier study is true? 


A manufacturing company claims that the mean production time for their product is not more than 
45 minutes. A random sample of 20 products selected from the production line of the company 
showed that the mean production time for this sample is 50 minutes with a standard deviation of 3 
minutes. Using 1% significance level, can we conclude that the company’s claim is true? 


The mean height of 45 students who showed an above-average participation in college sports was 
68.1 inches with a standard deviation of 2.42 inches, while 45 other students who showed no interest 
in such sports participation and a mean height of 67.67 inches with a standard deviation of 
2.76 inches. 


(a) test the hypothesis that the students who participate in sports activities are taller that the 
other students. 
(b) by how much the sample size of each of the two groups be increased in order that the observed 


difference of 0.75 inches in the mean heights be significant at the level of significance 
(i) 0.05 and (ii) 0.01. 


A garage has a line of electronic tune-up systems. Another line has been installed and the manager 
would like to find out if the output of the new line is greater than that of the old one. Twelve days of 
data was gathered at random from the old line and 10 days of data are gathered at random from the 


new line, with X, = 1,150 cases and X, = 1,175 cases. Itis known that o7 =50 and 63 =60. Test 
the appropriate hypotheses at o& = 0.05 assuming that the outputs are normally distributed. 


A company tested two different machines in order to buy the best one out of the two machines. 
Machine | was run for 8 hrs and it produced an average of 120 parts per hour with a standard 
deviation of 9 parts. Machine 2 was run for 10 hrs and it produced an average of 111 parts per 
hour with a standard deviation of 6 parts. The production per hour of both machines is assumed 
as normally distributed. The standard deviation of the hourly productions of the two populations 
are unequal. Using a 2.5% significance level, can we conclude that the mean number of parts 
produced per hour by machine 1 is higher than that of machine 2? 


To compare two brands of machine parts, Brand A and Brand B, for their carbon content, a sample 
of 60 was inspected from Brand A and a sample of 40 from Brand B. The results of the inspection 
are given below: 
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At the 5% significance level, do the two brands differ in their mean carbon content? 


The time required to change flat tires in a garage is a normal distributed random variable. The tire 
changing time for 15 automobiles selected at random are as follows: 


SR ESS UY: (Cn 0° 
(a) at =0.05, does it seem reasonable that the mean tire changing time is 10 hours? 
(b) does it seem reasonable that the mean tire changing time is less than 10 hours? 


(c) does it seem reasonable that the mean tire changing time is greater than 10 hours’? 


The results of tensile tests on parts made of steel specimens are as follows. The load at which 


specimen failure in MPa are: the sample mean X = 12, and the sample standard deviation is s = 3. 
The numbers of samples are 25. Do the data suggest that the mean load at failure exceeds 10 MPa? 
Assume that the load at failure has normal distribution, and uses a = 0.05. 


The maximum baking temperature for a certain component is supposed to be 90°C. The oven 
temperature have a standard deviation of 10°C at the maximum baking temperature. A random sample 
of 228 readings is taken and the average baking temperature is found to be 94°C. Do the data 
suggest that the mean temperature at baking process exceeds 90°C? Assume that the baking process 
has a normal distribution and use a = 0.05. 


A company sent six of its sales employees to attend a special course in order to improve the sales 
of the company’s product. The one-week sales of these salespersons before and after attended 
the course are given below: 


| Before | 9 | 12 | 14 | 16 | 18 | 25. 
14 [18 | 19 | 20 
Using 1% significance level, can we conclude that the mean weekly sales for all salespersons 


increase as a result of attending the course? Assume the population of paired differences has a 
standard normal distribution. 


Table P6.20 gives the systolic blood pressure of seven adults before and after the completion of a 
special dietary plan for 3 months. 


Table P6.20 


[Before [ 210 | 180 | 195 | 220 | 231 | 199 | 224 | 

(O58 N85. |. 185, | 2230/9220 
Consider ., as the mean difference between systolic blood pressure before and after completing 
this special dietary plan of all adults. Using the 5% significance level, can we conclude that the 


mean of the paired difference uw, is different from zero? Assume that the population of paired 
difference is approximately normally distributed. 


A random sample of 6 automobiles was selected and these automobiles were driven for one week 
without the gasoline additive and then for one week with the gasoline additive. The manufacturer 
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of the gasoline additive claims that the use of this additive increases gasoline mileage. The following 
table gives miles per gallon for these automobiles without and with the gasoline additive. 


Using the 2.5% significance level, can you conclude that the use of the gasoline additive increases 
the gasoline mileage? Assume that the population of paired differences is approximately normally 
distributed. 


A manufacturing company was testing two machines for production automation purpose. The first 
machine was run for 8 hours and it produced an average of 136 items per hour with a standard 
deviation of 9 items. The second machine was run for 10 hours and produced an average of 127 
items per hour with a standard deviation of 6 items. Assume that the production rates of both 
machines are normally distributed. 

(a) make a 95% confidence interval for the difference between the population means 

(b) using a 2.5% significance level, can it be concluded that the mean number of items produced 

per hour by the first machine is higher than that of the second machine? 


Refer to problem 6.9. Assume that GO; and O5 are unknown and not assumed to be equal. However 


Ss = 65 and S3 = 75. Test the appropriate hypotheses at a = 0.05. 


Refer to Problem 6.3. Assume that 6; and 63 are unknown but assumed to be equal with sample 
variances as given. Test the appropriate hypotheses at a = 0.05. 


The annual salaries, in thousands of dollars, of 10 men in middle management at a given corporation 
are: 55, 65, 68, 70, 52, 56, 60, 72, 71, 73, while those for 8 women are: 56, 49, 58, 51, 61, 55, 57, 59. Let 
X and Y denote the salaries of men and women, respectively; assuming normal distribution and 
equal standard deviations, test H,: W, = 4, against H,; Ww, > w. at the 0.05 significance level by 
constructing a critical region for the tests. 


The following information was obtained from two independent samples selected from two normally 
distributed populations with unknown but equal standard deviation. 


| Sample! | 5] 8] 9] 9] 1ofiofifi2zfi2[i3[i3T | 
| Sample 2 | 11 | 12 | 13 | 14 [ 14 | 16 | 16 | 17 [17 [ 18 [ 19 [19 | 
Test at the 2.5% significance level if 1, is lower than L.,, where w, and w, are the means of 
populations 1 and 2 respectively. 


A consulting agency was asked by a large health insurance company to investigate if business 
majors were better salespersons. A sample of 40 salespersons with a business degree showed that 
they sold an average of 10 insurance policies per week with a standard deviation of 1.80 policies. 
Another sample of 45 salespersons with a degree other than business showed that they sold an 
average of 8.5 insurance policies per week with a standard deviation of 1.35 policies. At 1% 
significance level, can we conclude that the persons with a business degree are better salespersons 
than those who has a non-business degree? 
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The administration wants to test if the mean GPAs (grade point averages) of all male and female 
college students who actively participate in sports are different. A random sample of 28 male and 
24 female students who actively participate in sports was selected. The mean GPAs of the two 
groups are found to be 2.62 and 2.74 out of 4.0 respectively, with the corresponding standard 
deviation equal to 0.43 and 0.38. Assume that the GPAs of all male and female students have a 
normal distribution with the same standard deviation. Test at the 5% significance level if the mean 
GPAs of the two populations are different. 


The percentage of manganese in an alloy used in metal castings is measured in 50 randomly selected 
parts. The sample standard deviation is s = 0.4. Test the hypothesis H,: 6 = 0.25 versus H,: 6 # 0.25 
using o = 0.05. 


The thickness of a machine part used in a system is its critical dimension and that measurements 
of the thickness of a random sample of 18 such parts have a variance of s? = 0.6, where the 
measurements are in thousandths of an inch. The process is considered good if the variation of 
the thickness is given by a variance not greater than 0.3. Assuming that the measurements 
contribute a random sample from a normal population, test the null hypothesis o° = 0.3 against the 
alternative hypotheses 0” > 0.3 at the 0.05 level of significance. 


In a random sample, the weights of 24 machined parts of a certain process have a standard deviation 
of 3 grams. Assume that the weights constitute a random sample from a normal population. test 
the null hypothesis o = 4 grams against the two-sided alternative o # 4 grams at the 0.01 level of 
significance. 


The variance of scores on a standardised engineering test for all seniors was 150 in 2003. A sample 
of scores for 20 seniors who took this test in 2007 gave a variance of 170. Test at the 5% significance 
level if the variance of current seniors on this test is different from that of 2003. Assume that the 
scores of all seniors on this test are approximately normally distributed. 


The gaskets manufactured by a particular company must have a variance of 0.003 mm for acceptance 
by the customer. A random sample of 29 such gaskets gave a variance of 0.0058 mm. Test at the 
1% significance level if the variance of all such gaskets is greater than 0.003 mm. Assume that the 
thickness of all such gaskets produced by this company are approximately normally distributed. 


The manufacturer of a certain brand of fluorescent lights claims that the variance of the lives of 
these is 4000 hours. A consumer agency took a random sample of 25 such lights and tested them. 
The variance of the lives of these lights was found to be 4800 hours. Assume that the lives of all 
such lights are approximately normally distributed. Test at the 5% significance level if the variation 
of such light is different from 4000 hours. 


A random sample of 15 light bulbs produced by a certain company was selected and each bulb 
was tested for the number of hours before it burned out. If s = 13 hours at the 5% level of 
significance, is the population standard deviation different from 10 hours? 


A manufacturing company has two production lines that turn parts to a predetermined length. 
From a sample of 21 items taken from production line | and 30 items taken from production line 2. it 
was found that s, = 15 and s, = 12. Test the hypothesis that the two production lines have the 
same variance from the predetermined length. Take o = 0.05. 
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A new voltage-regulating device is installed in an electromechanical system. Before its installation, 
a random sample yielded the following information about the percentage of breakdown of the 


system: X, =8, s; = 36 and n, = 8. After installation, a random sample yielded X, =7, s? =—e 


and n, = 9. 

(a) can you conclude that the variance are equal? Use a = 0.05 

(b) has the new voltage-regulating device reduced the machine system breakdowns significantly? 
Use & = 0.05. 


In bolted joints using gaskets the variability in the thickness of the gasket material is a critical 
characteristic of the bolted joint and a low variability is desirable for better performance of the 
bolted joint. Two different materials are being considered to determine whether one is superior in 
reducing the variability of the thickness. Twenty gaskets are selected. The sample standard deviation 
of gasket thickness are s, = 2.96 mm and s, = 3.05 mm, respectively. Is there any evidence to indicate 
that either gasket material is preferable? Use o = 0.05. 


Two independent random samples are drawn from two normal populations. Test H, CF = G3 at 


5% level of significance. 


[Sample1 | io] s[is]i2] 9[i4[ie| of | ey 
[| Sample 2 ]..8 | 13 [21 | 15 | 11 | 15 | 18 | 9 | 21 | 22] 11 [16 | 


A random sample of six values of a random variable X is 24, 25, 28, 31, 32, 40 and independently 
obtained eight sample values of a random value Y are 12, 14, 15, 16, 16, 17, 17, 19. Assume that X 


and Y are normally distributed, test the null hypothesis ooo. =o7 against the alternative 


hypothesis H, : 0% > Ge at the 0.05 significance level by finding the critical region for the 


alternative hypothesis. 


Refer to Problem P6.40. Check if Hy :0% = oO; be rejected in favour of H,:02 > ce at the 0.01 


significance level. 


Two different manufacturing processes are used by a company to produce turbine blades with 
identical mean surface roughness. It is desirable to select the process having the least variability 
in surface roughness. A random sample of n, = 11 parts from the first process results in a sample 
standard deviation s, = 5.1 micro centimeters, and a random sample of n, = 16 parts from the second 
process results in a sample standard deviation of s, = 4.7 micro centimeters. Find at 90% confidence 
interval on the ratio of the two standard deviations, S/S5. 


A manufacturing firm promises a distributor that not more than 0.10 of those delivered toys will be 
blemished. A random sample of 500 toys delivered over the past 6 months yields 75 blemished 
toys. Should the toy manufacturer be declared in default at o = 0.05? 


In a survey of 1,000 random samples of machined parts, 700 were found to be non-defective. The 
manufacturing company wants at least 60% of all parts it produces are non-defective. Using a 0.05 
significance level, can we conclude that the company’s manufacturing policy is maintained? 
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A political analyst knows that a certain district voted 70% Republican in the last Congressional 
election. He also knows, however, that drastic changes in this district have occurred since the last 
election. He believes that these changes may have changed the percentage of votes that will vote 
for the Republican candidate for Congress, but doesn’t know which way it will be changed. He 
polls 2,000 voters in the district and finds that 960 of these plan to vote for the Democratic candidate. 
Is his belief supported, at 0.01 level of significance? 


A sample survey shows that 615 of 820 people surveyed prefer to live in single homes. Test the 


hypothesis that the true proportion of people who prefer to live in single homes is 0.70. 
Given 0=0.05. 


A machine is found to produce 32 per cent defective bolts. After repairing the machine, it was 
found it produced 24 defective bolts in the first production of 100. Is the true proportion of defective 
tubes reduced after the repairs? Use a = 0.01. 


In a certain city only 400 men out of 900 were found to drink alcohol. Determine whether this 
information supports the view that majority of men in that city are non-alcohol drinkers. (Use 5% 
level of significance). 


It was found that the proportion of families in the U.S. who took vacation of at least 1 week last 
year was 20 per cent. In order to find the attitude of families on traveling this year, 100 families 
selected at random and were interviewed and, of these, 15 said they would take such a vacation. 
Has the attitude changed from last year? Use a = 0.10. 


Two different polishing materials are being evaluated for use in machining operations in a 
manufacturing company. One hundred machine parts were polished using polishing material A and 
of this number 87 had no polishing-included defects. Another 100 machine parts were polished 
using polishing material B and 75 parts were satisfactory upon completion. Is there any reason to 
believe that the two polishing materials differ? Use a = 0.01. 


Two different types of grinding machines are used to finish metal parts. A part is considered 
defective if it has excessive surface roughness or variation. Two random samples, each of size 100 
are selected and 5 defective parts are found in the sample from machine 1, while 3 defective parts 
are found in the sample from machine 2. Is it reasonable to conclude that both machines produce 
the same fraction of defective parts, using a = 0.05? 


A sample of 500 observations taken from the first population gave X, = 400. Another sample of 
600 observations taken from the second population gave X, = 420. 

(a) find the point estimate of p, — p, 

(b) make a 97% confidence interval for p, — Pp, 


(c) show the rejection and non-rejection regions on the sampling distribution of p, — p, for 
H,: p, =p, versus H,: p, > p,. Use a significance level of 2.5% 

(d) find the value of the test statistic z for the test of part (c) 

(e) do you reject the null hypothesis mentioned in part (c) at a significance level of 2.5%? 


According to a particular research study, 24% of children in the age group 3 to 17 were overweight 
in 1994 and 31% were considered overweight in 2003. These percentages are based on random 
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samples of 400 and 500 children in the given age group in 1994 and 2003, respectively. Test at the 
1% significance level if the proportion of overweight children in the age group was less than in 
1994 than in 2003. 


A sample survey conducted in a city results show that out of 800 men 480 are employed whereas 
out of 600 women only 350 were employed. Can this difference between two proportions of employed 
persons be ascribed due to sampling fluctuations? Use a = 0.05. 


An insecticide of Brand A was sprayed into a jar containing 12 insects and it was found 95 of the 
insects were killed. When another jar containing 145 insecticides of the same type was sprayed 
with Brand B, 124 insects were killed. Do the two brands differ in their effectiveness at 2% level of 
significance? 


A study polled independent random samples of 747 men and 434 women in a particular city. Of 
those sampled, 276 men and 195 women responded that they frequently ordered pure vegetarian 
meals. Do the data poll provide sufficient evidence to conclude that the percentage of men who 
sometimes order pure vegetarian meals is smaller than the percentage of women who sometimes 
order vegetarian meals? Use 5% significance level. 


Duk wn 


GE 


REVIEW QUESTIONS 


Describe the meaning of each of the following terms: 


(a) null hypothesis (b) alternate hypothesis 
(c) critical points (d) significance level 
(e) rejection region (f) nonrejection region 
(g) tails of a test (h) two types of errors 


Describe briefly Type I and Type II errors. 

Describe the general steps involved in testing a hypothesis. 

Explain the difference between the critical value of z and observed value of z. 
What does the level of significance represent in a test of hypothesis? 


Describe the conditions that must hold true to use in ¢-distribution to make a test of hypothesis 
about the population mean. 


Explain when a sample is large enough to use the normal distribution to make a test of hypothesis 
about the population proportion. 


Describe the following terms: 


(a) Oo (b) B 
(c)  left-tailed test (d) right-tailed test 
(e) one-tailed test (f) test statistic 


Explain the meaning of independent and dependent samples. 


Describe the sampling distribution of x,— x, for large and independent samples. What are the 
mean and standard deviation of this sampling distribution? 


Describe the conditions that must hold true to use the ¢-distribution to make a confidence interval 


and to test a hypothesis about , —, for two independent samples selected from two populations 
with unknown but equal standard deviations. 
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12. Explain when the paired sample procedures are used to make confidence intervals and test 
hypothesis. 


13. Describe the shape of the sampling distribution of p,— p, for two large samples. What are the 
mean and standard deviation of this sampling distributions? 
14. Define the following terms: 


(a) paired or matched samples (b) w, 
(c) Ss, (d) Oo, 
STATE TRUE OR FALSE 
1. Null hypothesis is a hypothesis to be tested. (True/False) 
2. Alternative hypothesis is a hypothesis to be considered as an alternative to the null hypothesis. 
(True/False) 
3. The problems in a hypothesis test is to decide whether the null hypothesis should be accepted in 
favour of the alternative hypothesis. (True/False) 
4. A hypothesis test whose alternative hypothesis has the form HH: b #U, ts called a one-tailed test. 
(True/False) 
5. A hypothesis test whose alternative hypothesis has the form H,: 1 < 1, is called a right-tailed test. 
(True/False) 
6. A hypothesis test whose alternative hypothesis has the form H,: > pL, is called a right-tailed test. 
(True/False) 
7. A hypothesis is a statement that something is true. (True/False) 
8. The decision criterion provides an objective method for deciding whether the null hypothesis should 
be rejected in favour of the alternative hypothesis. (True/False) 
9. Test statistic is used as a basis for deciding whether the null hypothesis should be rejected. 
(True/False) 
10. Rejection region is the set of values for the test statistic that leads to acceptance of the null 
hypothesis. (True/False) 
11. Nonrejection region is the set of values for the test statistic that leads to non rejection of the null 
hypothesis. (True/False) 
12. Critical values are the values of the test statistic that separate the rejection and nonrejection regions. 
(True/False) 
13. A critical value is considered part of the nonrejection region. (True/False) 
14. Type I error is accepting the null hypothesis when it is in fact true. (True/False) 
15. Type Il error is not rejecting the null hypothesis when it is in fact false. (True/False ) 


16. The probability of making a Type I error, that is, of rejecting a true null hypothesis. is called the 
significance level, a, of a hypothesis test. (True/False) 
17. For a fixed-sample size, the smaller we specify the significance level, a. the large will be the 
probability, B, of not rejecting a false null hypothesis. (True/False) 


18. If a null hypothesis is not rejected, we conclude that the alternative hypothesis is true. 
(True/False) 
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i) 


20. 


Pale 


38. 
39. 


If the null hypothesis is not rejected, we conclude that the data do not provide sufficient evidence 


to support the alternative hypothesis. (True/False) 
If it is important not to reject to a true null hypothesis, the hypothesis test should be performed at 
a small significance level. (True/False) 
For a fixed sample size, decreasing the significance level of a hypothesis test results in an increase 
in the probability of making a Type II error. (True/False) 
Hypothesis tests and confidence intervals are not closely related. (True/False) 
The power of a hypothesis test is the probability of not making a Type IJ error, that is, the 
probability of rejecting a false null hypothesis. (True/False) 
Type I error is rejecting a true null hypothesis. (True/False) 
Type II error not rejecting a true null hypothesis. (True/False) 
Significance level is the probability of rejecting a true null hypothesis, that is, of making a Type I 
error. (True/False) 
In the context of hypothesis testing, & is the probability of making a Type II error (rejecting a true 
null hypothesis) also known as the significance level of the hypothesis test. (True/False) 
In the context of hypothesis testing B is the probability of making a Type II error (not rejecting a 
false null hypothesis). (True/False) 
In the context of hypothesis testing, (1 — B) is the power of the hypothesis test (the probability of 
not making a Type II error or equivalently) of rejecting a false null hypothesis. (True/False) 
For a fixed significance level, increasing the sample size decreases the power. (True/False) 


Decreasing the significance level of a hypothesis test without changing the sample size decreases 
the probability of a Type II error, equivalently, decreases the power. (True/False) 
The P-value of a hypothesis test equals the smallest significance level at which the null hypothesis 
can be rejected, that is, the smallest significance level for which the observed sample data results 


in rejecting of H,. (True/False) 
If the P-value is less than or equal to the specified significance level, reject the null hypothesis, 
otherwise, do not reject the null hypothesis. (True/False) 
Independent sample means that the sample selected from one of the populations has no effect or 
bearing on the sample selected from the other population. (True/False) 
Population proportion is the proportion (percentage) of the entire population that has the specified 
attribute. (True/False) 
Sample proportion is the proportion (percentage) of a sample from the population that has the 
specified attribute. (True/False) 
A population proportion is a parameter because it is a descriptive measure for a population. 
(True/False) 
A sample proportion is a statistic because it is a descriptive measure for a sample. (True/False) 
Number of successes is the number of members in the sample that have the specified attribute. 
(True/False) 


Number of failures is the number of members in the sample that have the specified attribute. 
(True/False) 
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4]. The mean of all possible sample proportions is equal to the population proportion. _(True/False) 
42. For large sample, the possible sample proportions have approximately a chi-square distribution. 
(True/False) 


43. A rule of thumb for using a normal distribution to approximate the distribution of all possible sample 
proportions is that both np, n(1 — p) are 5 or greater. (True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. True 3.False 4. False 5.False 6. True 7. True 8. True 9. True 10. False 
11. True 12. True 13. False 14. False 15. True 16. True 17. True 18. True 19. True 20. True 
21. True 22. False 23. True 24. True 25. False26. True 27. True 28. True 29. True 30. False 
31. False 32. True 33. True 34. True 35. True 36. True 37. True 38. True 39. True 40. False 
41. True 42. False 43. True 


a | 


CHAPTER 


Curve Fitting, 
Regression and Correlation 


In real life engineering practice, often a relationship is found to exist between two (or more) variables. For 
example: the experimental data for force (N) and velocity (m/s) from a wind tunnel experiment. A mechanical 
element/component is suspended in a wind tunnel and the force measured for various levels of wind velocity. 
This relationship can be visualised by plotting force versus velocity. It is frequently desirable to express 
this relationship in mathematical/analytical form by establishing an equation connecting the variables. 


In order to determine an equation connecting the variables, it is often necessary to collect the data 
depicting the values of the variables under consideration. 


For example, if x and y denote respectively the velocity and force from the wind tunnel experiment, then 
a sample of n individual would give the velocities x,, x,, ..., x, and the corresponding forces Volpe When 
these points (x,, y,), (,, Y,), ---» @,, y,) are plotted on a rectangular coordinate system, the resulting set of 
points on the plot is called the scatter diagram. From such a scatter diagram, one can visualise a smooth curve 
approximating the given data points. Such a curve is known as an approximating curve. Figure 7.1(a) shows 
that the data appears to be approximated by a straight line and it clearly exhibits a linear relationship between 
the two variables. On the other hand Fig.7.1(b) shows a relationship which is not linear and in fact it is a 
nonlinear relationship between the variables. Curve fitting is the general problem of finding equations of 
approximating curves which best fit the given set of data. 


y 


(a) Linear Relationship (b) Non-linear Relationship 


Fig. 7.1: Linear and non-linear relationship 
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Approximating Curves 


ee 7.1 lists a few common approximating curves used in practice and their equations. The variables x 
and y are called the independent and dependent variables respectively. The scatter diagrams of the variables 
or transformed variables will help determine the curve to be used. 


Table 7.1 
No. Equation Description of the curve 
yz=atbx : Straight line 
2. | y=a+bx+t+ Ox Parabola or quadratic curve 
liyaa + bx+ ox +dx Cubic curve 
A. | y=atbx+er +d +ex’ Quadratic curve 
S. | y=antaxt+ax +... + a,x" n" degree curve 
1 1 
6. | y= or —=c+mx 
. = Hyperbola 
y =ab* or log y= log a + x(log b) = ay + box Exponential curve 
sr y = ax’ or logy = log a + Dilog x) Geometric curve (power function) 
ca y= abe c Modified exponential curve 
y=ax’+e Modified geometric curve 
v= pq’ or logy =logp + b' log g = ab‘ + g | Gompertz curve 
y= pg’? +h Modified Gompertz curve 
| | , | 
ee eee Ol. Dt 9 Logistic curve 
Op ag y 
Veaoee or y= boy Exponential function 
1 
y= Reciprocal function 
mx+b 
16. | y=Qu = Saturation-growth-rate equation 
B+x 


Iisbale wr, 0, 0Nerd, €, AVAWH,, 043, 4, DID, h, m, a and b are all constant coefficients. 


Linear Regression 


Linear regression and correlation are two commonly used methods for examining the relationship between 
quantitative variables and for making predictions. In this chapter, we review linear equations with one 
independent variable, explain how to find the regression equation, the equation of the line that best fits 
a set of data points. We also examine the coefficient of determination that is a descriptive measure of 
the utility of the regression equation for making predictions. In addition, we discuss the linear correlation 
coefficient, which provides a descriptive measure of the strength of the linear relationship between the 
two quantitative variables. 
The inferential methods in regression and correlation are also covered in this chapter. 


7.1 LINEAREQUATION 


The general form of a linear equation with one independent variable can be written as 


y=a+h 


326 // Probability and Statistics for Scientists and Engineers /I 


where a and b are constants (fixed numbers), x is the independent variable, and y is the dependent variable. 
The graph of a linear equation with one independent variable is a straight line, or simply a line. Also, any 
non vertical line can be represented by such an equation. 

Linear equations with one independent variable occur frequently in applications of mathematics to 
many different fields, including the social sciences, engineering, and management as well as physical and 
mathematical sciences. 

For a linear equation y = a + bx, the number a is the y-value of the point of intersection of the line and 
the y-axis. The number b measures the steepness of the line. b indicates how much the y-value changes 
when the x-value increases by 1 unit. Figure 7.2(a) illustrates these relationships. 


b units up 


1 unit 
increase 


Fig. 7.2(a): Graph of y = a + bx 


The numbers a and b have special names that reflect these geometric interpretations. For a linear 
equation y = a + bx, the number a is called the y-intercept and the number b is called the slope. 


The graph of the linear equation y = a + bx slopes upward if b > 0, slopes downward if b < 0, and is 
horizontal if b = 0, as shown in Fig. 7.2(b). 


b>0 b<0O 


Fig. 7.2(b): Graphical interpretation of slope 


7.2 CURVE FITTING WITH A LINEAR EQUATION | : 


Curve fitting is a procedure in which a mathematical formula (equation) is used to best fit a given set of 
data points. The objective is to find a function that fits the data overall. Curve fitting is used when the 
values of the data points have some error, or scatter and require a curve fit to the points. Curve fitting can 
be accomplished with many types of functions and with polynomials of various orders. 
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Curve fitting using a linear equation (first degree polynomial) is the process by which an equation of 

the form 
y=a+bx (7.1) 
is used to best fit the given data points. This can be accomplished by finding the constants a and b that 
give the smallest error when the data points are substituted in Eq. (7.1). If the data points consists of onls 
two points, the constants can be obtained such that Eq. (7.1) gives the exact values at the points. 
Figure 7.3 shows the straight line corresponding to the Eq. (7.1) and passing through the two points. When 


the data has more than two points, the constants a and b are determined such that the line has the best fit 
overall as shown in Fig. 7.4. 


y a 


fe) x 


Fig. 7.3: Straight line connecting Fig. 7.4: A straight line passing 
two points through many data points 


The procedure for obtaining the constants a and b that give the best fit requires a definition of best fit 
and an analytical procedure for deriving the constants a and b. The fitting between the given data points 
and an approximating linear function is obtained by first computing the error. also called the residual. which 
is the difference between a data point and the value of the approximating function. at each point. Figure 7.5 
shows a linear function (straight line) that is used for curve fitting n points. 


(x;) 


Fig. 7.5: Curve-fitting points with a linear equation y = a + bx 
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Thus, the residual, e is the discrepancy between the true value of y and the approximating value, 
a + bx, predicted by the linear equation. 


7.3 CRITERIA FORA “BEST” FIT - 


A criterion that measures how well the approximating function fits the given data can be determined by 
computing a total error E in terms of the residuals as 


E= Ye, = Vly; - (at bx)] (7.2) 


i=l i=l 
where n = total number of points. 
However, this is an inadequate criterion, as illustrated in Fig.7.6, which shows that E is zero since 


é,=-€, and C= ex 


O 
Fig. 7.6: Straight line fit with E = 0 


One way to remove the effect of the signs might be to minimize the sum of the absolute values of the 
discrepancies: 


E=Ylel= >| y,-4-5y; | (7.3) 
i=l i=l 


Figure 7.7 shows why this criterion is also inadequate. For four points show, for the same set of points 


there can be several functions that give the same total error. E is the same for the two approximating lines 
in Figure 7.7. 


O x 


Fig. 7.7: Two straight line fits with the same total error as per Eq. (7.3) 
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A third strategy for fitting a best line is the minmax criterion. [n this technique, the straight line is 

chosen that minimizes the maximum distance that an individual point falls from the line. Again as shown in 
Fig. 7.8, this technique gives undue influence to an outlier (a single point with a large error). 


O 
Fig. 7.8: Minimization of the maximum error of any individual point 


A method that overcomes the shortcomings of the aforementioned approaches is to minimize the sum 
of the squares of the residuals: 


Se se = Ny (y; -a—bx;) (7.4) 
f=] i=] 


Equation (7.4) always gives a positive number of S_ (total error) and positive and negative residuals do 
not cancel each other. This criterion (Eq. 7.4) is called the least squares and has many advantages. including 
that it gives a unique line for a given set of data. 

Equation (7.4) can be used to determine the coefficients a and b in the linear function y = a + bx that 
yield the smallest total error. This is accomplished by using a procedure called linear least-squares regression, 
which is presented in the next section. 


74 LINEAR LEAST-SQUARES REGRESSION 


Linear least-squares regression is a method in which the coefficients a and b of a linear function y = a + bx 
are determined such that the function has the best fit to a given set of data points. The best fit is defined 
as the smallest possible total error that is computed by adding the squares of the residuals according to 
Eq. (7.4). 


For a given set of » data points (x, y,), the overall error calculated by Eq. (7.4) is 


S, = ¥by;-(at bx )P (75) 
i=l 
Since the values of x, and y, are known, S, in Eq. (7.5) is a non-linear function of two variables a and b. 
This function S has a minimum at the values of a and b where the partial derivatives of S| with respect to 
each variable is equal to zero. 
Taking the partial derivatives and setting then equal to zero gives 
os 


Wr = -29\(y, -a-bx,)=0 (76) 
da 2 p 
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Se = 29 f(y, -a- bx;)x,]=0 (7.7) 
i=] 


Equations (7.6) and (7.7) are a system to two linear equations for the two unknowns a and b, and can 
be rewritten in the form 


na+ b a} =D | (78) 
it | i=! 
Pa =p) a = y x0) (7.9) 
i=] | i=] 


Equations (7.8) and (7.9) are called the normal equations and can be solved simultaneously for 


Reon Be Ee 


b =I : (7.10) 
n> re [5 s| 
=! f= 
le 
3 xy; = f=1 i=] 
or b= a (7.11) 
ee i=l 
i=l 2 
Pia Pala Pad 
and a= ae G12) 


or a=y-bx 


Since Eqs. (7.10) to (7.12) contain summations that are the same for a set of n points, they can also be 
written as, 


SS, = 20, =%) =ix —(2 (7.13) 
SS, = X(x; - X)(y¥,-Y)= 2x; y; — (=x; Mzy,)/n (7.14) 
SS,, =Z(y; -¥)? = Zy? -(dy,)" /n (7.15) 


The regression equation for a set of n data points is 


pr=atbhx 
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where b= aes: 
SS. (7.16) 

and Ly ae 
a= a5 (2y; — b&x;) = y — bx (7.17) 


7.0 LINEAR REGRESSION ANALYSIS 


A regression model is a mathematical equation that describes the relationship between two or more variables. 
A single regression model includes only two variables: one independent and one dependent. The relationship 
between two variables in a regression analysis is expressed by a mathematical equation called a regression 
equation or model. A regression equation that gives a straight line relationship between two variables is 
called a linear regression model; otherwise, it is called a nonlinear regression model. Figure 7.9(a) and (b) 
show a linear and a nonlinear relationship between independent variable and the dependent variable. 


2 o 

ic 8 Non-linear 

& TS 

> g 

iE i ie 

5 Linear S 

12 no) 

cS [a 

ie)) o 

(Ou a. 

© (a) 

(a) Q 

O : O : 

Independent variable Independent variable 


(a) (b) 
Fig. 7.9: (a) Linear relationship, (b) Non-linear relationship 


The equation of a linear relationship between two variables x and y is written as 
y=at bx (7.17a) 
where a gives the y-intercept and b represents the slope of the line. 
In regression model, x is the independent variable and y is the dependent variable. The simple linear 
regression model for population is written as 
y=A+Bx (7.17b) 
Equation (7.17b) is called a deterministic model. It gives an exact relationship between x and y. However, 
in many instances the relationship between the variables is not exact. Therefore, the complete regression 
model is then written as 
yA+Bx+eE (7.17c) 
where € is called the random error term. This regression model (Eq. (7.5)) is called a probabilistic model 
(or a statistical relationship). The random error term € is included in the model to take into consideration 
of the following two phenomena: 
(a) Missing or omitted variables: The random error term € is included to capture the effect of all the 
missing or omitted variables that were not included in the model. 
(b) Random variation: The random error term € is included to capture the random variation. 
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In Eq. (7.17c), A and B are the population parameters. The regression line obtained from Eq. (7.17c) by 
using the population data is called the population regression line. The values of A and B is the population 
regression line is called the true values of the y-intercept and slope. 

However, most often the population data is difficult to obtain. As a consequence, we almost always 
use the sample data and use the model given by Eq. (7.17c). The values of the y-intercept and slope calculated 
from sample data on x and y are called the estimated values of A and B and are denoted by a and b. 


The estimated regression model is then written as 
Y=atbx (7.17d) 


where } is the estimated or predicted value of y for a given value of x. Equation (7.17d) is called the 


estimated regression model. It gives the regression of y on x. A plot of paired observation is called a scatter 
diagram as shown in Fig. 7.10. 


Fig. 7.10: Scatter diagram 


To find the line that best fits the scatter of points, we minimize the error sum of squares, denoted by 
SSE, which is given by 


SSE = Xe? = X(y-$) (7.17e) 
where e=y- jy. 


The least-squares method gives the values of a and b such that the sum of squared errors (SSE) is 
minimum. 


For the least-squares regression line ) =a + bx from Egs. (7.10) to (7.17), we have 


_ SS an 
= 35., and a= y —bx (7.18) 
where SS,, = Ixy - (2x29) 
. n 
2 
SSeh = ce cals 
nN 

SS, = Z(y— y) = Ly” -(Zy)? /n (7.19) 


The least-squares regression line = a + bx is also called the regression of y on x. 
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Example E7.1 


Table E7.1 gives experimental data for force (N) and velocity (m/s) for an object suspended in a wind tunnel. 


Table E7.1 


Velocity, v(m/s) 


(a) use the linear least-squares regression to determine the coefficients a and b in the function 


y =a+ bx that best fits the data 


(b) estimate the force when the velocity is 55 m/s. 


SOLUTION: 


Here n = 8. 


8 
Pe aa" sisi [aaa [319850 


From Eq. (7.16), we have 


SS 
Be 
(Zx)(Zy) _ _ (360)(5131) | 
where SSS) a ae 312830 sg = 81935 
2 _ (Ex)* (360)* _ 
and 55g =e — = 20400——— ee 


_ Sx _ 81935 _ 19 5083 
5S, 4200 


From Eq. (7.17), we have 
a= y—bx = 641.375 — (19.5083)(45) = —236.50 


Hence § =-236.50 + 19.5083x 


334 // Probability and Statistics for Scientists and Engineers // 


(b) The estimated value of the force when the velocity is 55 m/s is given by 
} =at bx =-236.50 + 19.5083(55) = 836.4583 N 


7.6 INTERPRETATION OF 4 AND b 


When b is positive, an increment in x will lead to an increase in y and a decrease in x will lead to a decrease 
in y. That is, when b is positive, the movements in x and y are in the same direction. Such a relationship 
between x and y is called a positive linear relationship. The regression line slopes upward from left to 
right. 

Similarly, if the value of b is negative, an increase in x will cause a decrease in y and a decrease in x will 
cause an increase in y. The changes in x and y are in opposite directions. Such a relationship between x and 
y is called a negative linear relationship. The regression line slopes downward from left to right. 
Figure 7.11 shows these two relationships. 


yi y, 

b>0 b>0 
O é O * 
(a) Positive linear relationship (b > 0) (b) Negative linear relationship (b < 0) 


Fig. 7.11: Positive and negative relationship between x and y 


Assumptions in the Regression Model 
The linear regression analysis is based on the following assumptions: 
1. The random error term € has a mean equal to zero for each x. 
The errors associated with different observations are independent. 


For any given x, the distribution of errors is normal. 


AWN 


The distribution of population errors for each x has the same (constant) standard deviation, which 


is denoted by 6, as shown in Figs. 7.12(a) and (b). 
Normal distribution 
with (constant) 
standard deviation o, 
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Population 
regression 
line 


(b) 
Fig. 7.12: (a) Assumptions in the regression model, (b) Distribution on the regression line 


7.7 _ STANDARD DEVIATION OF RANDOM ERRORS 


The standard deviation 6, measures the spread of the errors around the regression line as shown in 
Fig. 7.12(b). The standard deviation of errors is calculated using 


| SSE 
S, = 
n—-2 


where SSE= X(y- 5)? 
[SS,, —bSS,, 
or s. =4,———— (7.20) 
n-2 
where SS,, = Z(y—y)* = Zy? - yy 
; n 
zx)(Ly 
SS, = Lxy - ee) (7.21) 


n 
In Eq. (7.20), (n — 2) represents the degrees of freedom for the regression model. The reason for 


df = n — 2 is that we lose one degree of freedom to calculate x and one for y. 


7.8 COEFFICIENT OF DETERMINATION . : : 


The coefficient of determination, denoted by r’, represents the proportion of the foral sum of squares that 
is explained by the use of the regression model. The computational formula for 7 is given by 


SS 
ae O<r<l1 (7.22) 
Sig 


The total sum of squares denoted by SST is the total variation in the observed values of the response 


=p 


variable 
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2 zy) 
SST= 55, =e ey =" (7.23) 


which is the same formula we use for SS. 


The regression sum of squares, denoted by SSR, is the variation in the observed values of the response 
variable explained by the regression: 


SSE= X(y- 39) 
SSR = SST - SSE 
or SST = SSR + SSE (7.24) 
The ratio of SSR to SST gives the coefficient of determination. That is, 
2 SSR __SST-SSE_, SSE 


= = ae (emoaotw ; 
Sor Dol Sol Gay 


: 

This formula shows that we can also interpret the coefficient of determination as the percentage 
reduction obtained in the total squared error by using the regression equation instead of the mean, jy, to 
predict the observed values of the response variable. 


The coefficient of determination, denoted by r, represents the portion of SST that is explained by the 
use of the regression model. Hence, 


eo 
— - 
SS,y (7.26) 
and 0O<r<l 


The value of 7° is the proportion of the variation in y that is explained by the linear relationship between 
x and y. The coefficient of determination is the amount of the variation in y that is explained by the regression 
line. It is computed as 


2 _ explained variation _ 2(5- 9)" 
total variation X(y- y) 
The coefficient of determination, 7°, always lies between 0 and 1. A value of 7° near 0 suggests that the 


regression equation is not very useful for making predictions, whereas a value of r° near | suggests that 
the regression equation is quite useful for making predictions. 


Example E7.2 | 


SSS pe 


For the data of Example E7.1, calculate the 
(a) standard deviation of errors, s_ 
(b) error sum of squares, SSE 
(c) total sum of squares, SST 
(d) regression sum of squares, SSR 
(e) the coefficient of determination, 7’. 
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SOLUTION: 
Referring to Table E7.2, we have 
n = 8, Xx = 360, Ly = 5131, Xx? = 20400, Try = 312830, dy’ = 5104841 


f= — = = 45 
n 8 
Bes keg 
y= => = 641,375 
n 8 
ExV(= 
SS, = Exy—- VP”) _ 319939- ae) 81935 
hn 
Soe = 504092 2 api 
n 
5 NOSy)- 5131)* 
DS Dy 27 = sioga1- 8 ) = 1813945.875 
i 
SS, 81 
b= = at 19.5083 
SS 4200 


a= y —bx = 641.375 —19.5083x 


(a) The standard deviation of errors, s, 


SSyy — BSS. 1813945. tee 1813945.875 —19.5083(81935) ieee 


(b) The error sum of squares, SSE 
SSE = X(y— $)* = 215530.5833 


(c) Total sum of squares, SST 
SST = SS. = 1813945.875 


(d) The regression sum of squares, SSR 
SSR = SST — SSE = 1813946.875 — 215530.5833 = 1598415.2917 


(e) The coefficient of determination, 7 


2 PSS _ (19.5083)81935) _ 9 9915 
SSy, :1813945.875 


7.9 SAMPLING DISTRIBUTION OF 4 3 3 | 


In most cases, the regression line is estimated using sample data. Inferences are then made about the 
population regression line based on the sample regression line. The slope b of a sample regression line is a 
point estimator of the slope B of the population regression line. Different values of b would result in different 


338 // Probability and Statistics for Scientists and Engineers // 


sample regression lines estimations based on different samples taken from the same population. Therefore, 
b is a random variable and it possesses a probability distribution that is more commonly called its sampling 
distribution. 

We describe here the shape of the sampling distribution of b, its mean, and standard deviation. The 
sampling distribution of b is normal based on the assumption of normally distributed random errors. 


The mean and standard deviation of b, denoted by o, and Cr respectively, are 


M,=B 


and op ——— (7.27) 
<p 

In general, the standard deviation of population errors 6, is not known. Therefore, the sample standard 

deviation of errors ©, is used to estimate o.. In this case, when 0. is unknown, the standard deviation of b 


is estimated by o, given by 


er (7.28) 


eG 

If the sample size is large (n = 30) and o, is unknown, then the normal distribution can be used to make 
inferences about B. However, when 6, is unknown and the sample size is small (n < 30), then the normal 
distribution is replaced by the f-distribution to make inferences about B. 


Estimation of B 

The value of b computed from the sample regression line is a point estimate of the slope B of population 
regression line. When ©, is unknown and the sample size is small (7 < 30), then the t-distribution is used to 
make a confidence interval for B. 

Confidence Interval for B 

The (1 — a) 100% confidence interval for B is given by 


b+ts, (7.29) 


where yy = 


(7.30) 


The value of tis obtained from the f-distribution table (Appendix-G) for 0/2 area in the right tail of the 
t-distribution and n — 2 degrees of freedom. 


Example Ew3 4 r 


Construct a 95% confidence interval for B for the data in Example E7.1. 


SOLUTION: 
Refer to the solutions of Examples E7.1 and E7.2. 


n= 8, b= 19.5083, SS. = 4200 and 5, = 189.5304 
The confidence level is 95% 
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s. _ 189.5304 
VSS, 4200 
df=n—2=8-2=6 
0/2 = 0.5 — (0.95/2) = 0.025 


From the f-distribution table (Appendix-G), the value of ¢ for 6 df and 0.025 area in the right tail of the 
t-distribution is 2.447. 


Ls = 


= 2.9245 


The 95% confidence interval for B is 
b+ts, = 19.5083 + (2.447)(2.9245) = 19.5083 + 7.1563 = 12.3520 to 26.6646 


Hence, we are 95% confident that the slope B of the population regression line is between 12.3520 and 
26.6646. 


7.10 HYPOTHESIS TESTING ABOUT B 


To test the hypothesis that x does not determine y linearly, we test the null hypothesis that the slope of the 
regression line is zero, that is, B = 0. The alternative hypothesis is one of the following: 


(a) x determines y, that is, B #0. 
(b) x determines y positively, that is, B > 0. 
(c) x determines y negatively, that is, B <0. 
The value of the test statistic t for b is calculated as 
= D= 6b 


Sp 


G31) 


The value of B is substituted from the null hypothesis. 


Example E7.4 


Test at the 1% significance level if the slope of the regression line for the Example E7.1 is positive. 


SOLUTION: 
Refer to the solutions obtained earlier for Examples E7.1 to 7.3. 
Here n= 8, b= 19.5083 and s, = 2.9245 
The two hypotheses are 

H,; B = 0 (slope is zero) 

H,: B > 0 (slope is positive) 
We can also write the null hypothesis as H,: B < 0 (slope is either zero or negative). 
Since the sample size is small (1 < 30) and © is unknown, we use the f-distribution to make the test about B. 
The significance level is 0.01. The > sign in the alternative hypothesis indicates that the test is right-tailed. 
Hence 
Area in the right tail of the t-distribution , & = 0.01 

df=n-—2=8-2=6 
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From the r-distribution table (Appendix-G), the critical value of ¢ for 6 dfand 0.01 area in the right tail of the 
t-distribution is 3.143, as shown in Fig. E7.4. 


Do not reject H, | Reject Hy 


critical value of t 


Fig. E7.4 


Now 


, b= B _19.5083~0 (from Ho) 
Sy 2.9245 


The value of the test statistic t = 6.6706 is greater than the critical value of t = 3.143, and it falls in the 
rejection region. Therefore, we reject the null hypothesis and conclude that x determines y positively. 


= 6.6706 


7.11 LINEAR CORRELATION 


Linear correlation coefficient is a measure of the relationship between two variables. Linear correlation 
coefficient measures how closely the points in a scatter diagram are spread around the regression line. The 
correlation coefficient calculated for the population is denoted by p and the one calculated for sample data 
is denoted by r. The linear correlation coefficient r measures the strength of the linear relationship between 
the paired x- and y- quantitative values in a sample. The linear correlation coefficient is sometimes referred 
to as the Pearson product moment correlation coefficient in honor of Karl Pearson (1857-1936), who originally 
developed it. Square of the correlation coefficient is equal to the coefficient of determination. The value of 
the correlation coefficient always lies in the range —1 to 1. Hence -1 <p <1 and-1<r<l. 


If r= 1, it refers to a case of perfect positive linear correlation and all points in the scatter diagram lie 
on a straight line that slopes upward from left to right, as shown in Fig. 7.13. If r = —1, the correlation is said 
to be perfect negative linear correlation and all points in the scatter diagram fall on a straight line that 
slopes downward from left to right, as shown in Fig.7. 13(b). 


When there is no linear correlation between the two variables and r is close to 0. Also, in this case, 
all the points are scattered all over the diagram as shown in Fig. 7.13(c). 


x fe) x 
(a) (b) 

Fig. 7.13: Linear correlation between two variables (a) Perfect positive linear correlation, 

r= 1, (b) Perfect negative linear correlation, r = -1, (c) No linear correlation, r = 0 


O 
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Two variables are said to have a strong positive linear correlation when the correlation is positive 
and close to 1. If the correlation between the two variables is positive but close to zero, then the variables 
have a weak positive linear correlation. Similarly, when the correlation between two variables is negative 
and close to —1, then the variables are said to have a Strong negative linear correlation. A weak oumeniie 


linear correlation exists when the correlation between the variables is negative but close to zero. The 
above four cases are shown in Figs. 7.14(a) to (d). 


eee eX 


(a) Strong positive correlation (b) Weak positive linear correlation 
(r close to 1) (ris positive but close to zero) 


(c) Strong negative linear correlation (d) Weak negative linear correlation 
(r close to —1) (r is negative and close to zero) 


Fig. 7.14: Linear correlation between variables 


(a) Perfect positive linear (b) Strong positive linear (c) Weak positive linear 
correlation r = 1 correlation r = 0.92 correlation r = 0.42 
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Y 


vi 
2 
e e, 
Xx x 
(d) Perfect negative (e) Strong negative linear (f) Weak negative linear 
linear correlation r = 1 correlation r = 0.92 correlation r = 0.42 
y 
° 
e Se 


(g) No linear correlation 
(linearly uncorrelated) r = 0 


Fig. 7.14(a): Various degrees of linear correlation 


The simple linear correlation, denoted by r, measures the strength of the linear relationship between 
two variables for a sample and is calculated as 


SS xy 


= (7.32) 
Sins S 


It should be noted here that r and b calculated for the same sample will always have the same sign. 


Properties of the Linear Correlation Coefficient r 


iB 
2 
a 


The values of r is always between —1 and +1 inclusive. That is, -1 <r< 1. 
The values of r does not change if all values of either variable are converted to a different scale. 


The value of r is not affected by the choice of x or y. Interchange all x- and y-values and the value 
of r will not change. 


r measures the strength of a linear relationship. The magnitude of r indicates the strength of the 
linear relationship. A value of r close to —1 or to 1 indicates a strong linear relationship between 
the variables and that the variable x is a good linear predictor of the variable y. That is, the regression 
equation is extremely useful for making predictions. A value of r near 0 indicates at most a weak 
linear relationship between the variables and that the variable x is a poor linear predictor of the 
variable y. That is, the regression equation is either useless or not very useful for making predictions. 
It is not designed to measure the strength of a relationship that is not linear. 


The sign of r suggests the type of linear relationship. A positive value of r suggests that the 
variables are positively correlated, meaning that y tends to increase linearly as x increases, with 
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the tendency being greater the closer that r is to 1. A negative value of r suggests that the variables 


are negatively linearly correlated, meaning that y tends to decrease linearly as x increases, with the 
tendency being greater the closer that r is to -1. 


6. rreflects the slope of the scatter plot. The linear correlation coefficient is positive when the scatter 
plot shows a positive slope and is negative when the scatter plot shows a negative slope. 


7. The sign of r and the sign of the slope of the regression line are identical. If r is positive, so is the 
slope of the regression line. That is, the regression line slopes upward. If r is negative, so is the 
Slope of the regression line and the regression line slopes downward. 


Explained and Unexplained Variation 
The total variation is defined as X(y—y)’, i.e., the sum of the squares of the deviations of the values of \ 


from the mean y. This can be written as 


U(y—y)? = Xy- I) +4(H-y)? (7.32a) 
where y is the value of y for given values of x as estimated from $= a+bx, a measure of the scatter about 
the regression line of y on x. 

The first term on the right side of Eq. (7.32a) is called the unexplained variation while the second term 


is called the explained variation. The deviations y—y have a definite pattern while the deviations 
y — ¥ behave in a random or unpredictable manner. Similar results hold true for the variable v. 


The ratio of the explained variation to the total variation is called the coefficient of determination. If 
there is zero explained variation i.e., the total variation is all unexplained, then this ratio is zero. If there is 
zero unexplained variation i.e., the total variation is all explained, the ratio is one. In all other cases, the ratio 
lies between zero and one. The ratio is always non-negative. 


The quantity, r, is called the coefficient of correlation, and it is given by 


een jexplained sezaiion =. 
total variation 


r varies between —1 and +1. The signs + are used for positive linear correlation and negative error correlation 
respectively. r is a dimensionless quantity. The coefficient of determination equals the square of the linear 


(7.32b) 


correlation coefficient. 


Example E7.5 


Determine the correlation coefficient for the data given in Example E7.1. 


SOLUTION: 
Refer to the solutions obtained earlier for Examples E7.1 and E7.2. We have 
SS_ = 81935, SS = 4200 and SS, = 1813945.875 


ose 81935 


af SSigg5'5 yy . (4200)(1813945.875) 


= 0.9387 


Hence, 
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7.12 ESTIMATING THE MEAN VALUE OF y 


The population regression model is 
y=A+Bx+eE 


and the mean value of y for a given x is denoted by p.,,. Since the mean value of € is assumed zero, the 
mean value of y is given by 


MW =A+ Bx 


The value of { obtained from the sample regression line by substituting the value of x, is the point 


estimate of w,, for that x. All possible samples of the same size taken from the same population will give 
different regression lines as shown in Fig. 7.15 and a different point estimate of p.,,. Hence, a confidence 
interval constructed for 1. based on one sample will give a more reliable estimate of 1. than with a point 
estimate. . 


Population 
regression line 
My =A + Bx 


Regression lines Y= a + bx 
+—————- estimated from different samples 


Fig. 7.15: Population and sample regression lines 
In order to construct a confidence interval for ,, the mean, the standard deviation, and the shape of 
the sampling distribution of its point estimator y should be known. The point estimator 9 of 1, is assumed 
normally distributed with a mean of A + Bx and a standard deviation of . 


1, Go-*)* 
n SS 


xX 


Oy =O (7,33) 


(= 


aA 


where Oy =the standard deviation ) of when it is used to estimate W 


x, =the value of x for which we are estimating 1, 
6. =the population standard deviation of €. 


= 


Here, 6_ is not known and it is estimated by the standard deviation of sample errors s.. Therefore, 6 
4 € 


will be replaced by sand Ss, by in Eq. (7.33). Also, we use the normal distribution for a sample size which 


is large (n = 30) to make a confidence interval for .,. Similarly, t-distribution will be used for a small sample 
size (n < 30). 


The (1 — ot) 100% confidence interval for w.,,. for x = x, is given by 


put Sy (7.34) 
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where the value of r is obtained from the ?-distribution table (Appendix-G) for o/2 area in the right tail of the 


t distribution curve and df =n — 2. The value of Sy is computed from 


(735) 


Example E7.6 
nn 


Refer to Example E7.1 for the velocity and forces. Determine a 99% confidence interval for the mean force 
with a velocity of 5O m/s. 
SOLUTION: 


From the solution of Example E7.1, the point estimate of the mean force for x = 50 m/s is 


¥ =-236.50 + 19.5083x (N) = —236.50 + 19.5083(50) = 738.9167 
The confidence level is 99%. Therefore, the area in each tail of the t-distribution is 
o/2 = 0.5 — (0.99/2) = 0.005 
The degrees of freedom are 
df=n—-2=8-2=6 


From the ¢-distribution table (Appendix-G), the ¢ value for 0.005 area in the right tail of the r-distribution 
and 6 df is 3.707. 


From the solutions of Examples E7.1 and E7.2, we have 
S, = 189.5304, x = 45 and SS_ = 4200 
The standard deviation of as an estimate of [1 for x, = 50 m/s is given by 


=) ae 
S. =5, Ly emia ae = 189.5304 ji, GO=45)" = 68.5860 
i n SS,, 8 4200 


Therefore, the 99% confidence interval for 1, is 
HHS, = 738.9167 + 3.707(68.5860) = 738.9167 + 254.2483 = 484.6684 to 993.1650 


Hence, with 99% confidence we can state that the mean force for all velocities with a velocity of 50 m/s 1s 
between 484.6684 and 993.1650 N. 


7.13 ESTIMATING A PARTICULAR VALUE OF y 


The regression model can also be used to estimate a particular value of v for a given value of x. say x,. 
Here, a confidence interval constructed for y., based on one sample can give a more reliable een ‘, 

i i i i i a prediction 
than with a point estimate. The confidence interval constructed for y, is more commonly called a p 


interval. 
The procedure to construct a prediction interval for y, is similar to that for constructing a confidence 


interval for u... However, the standard deviation of {is larger since we predict a single value of y than we 
wx" , 


estimate [,.. 
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The point estimator ) of a is normally distributed with a mean of A + Bx and a standard deviation of 


(7.36) 


where ©» =the standard deviation of the predicted value of y 


x, =the value of x for which the prediction of y is made 


s_ =the population standard deviation of €. 


= 


In general, o_ is not known. We use the normal distribution in order to make a prediction interval for 7 
when the sample size is large (1 > 30) and r-distribution when the sample size is small (7 < 30). 


The (1 — &) 100% prediction interval for the predicted value of y, denoted by y for x = x, is given by 
tt 9, (7.37) 
where the value of f is obtained from the ¢-distribution table (Appendix-G) for o//2 area in the right tail of the 


t-distribution curve and df =n — 2. The value of 5) is computed as 


(7.38) 


Example E7.7 


Refer to Example E7.1 on velocities and forces. Find a 99% prediction interval for the predicted forces for a 
randomly selected velocity of 50 m/s. 


SOLUTION: 
The regression line was estimated in Example E7.1. The point estimate of the predicted force for x = 50 m/s 
is given by from Example E7.6 
y =—236.50 + 19.5083x (N) = 738.9167 
The area in each tail of the t-distribution for a 99% prediction level is o/2 = 0.5 — 0.99/2 = 0.005. 
The degrees of freedom are 
df=n-—2=8-2=6 


From the t-distribution table (Appendix-G), the ¢ value for 0.005 area in the right tail of the t-distribution 
curve and 6 df is 3.707. From the results of Examples E7.1 and E7.2, we have 


s, = 189.5304, ¥=45 and SS_=4200 


The standard deviation } of as an estimator of ae for x = 45 m/s is computed using 


1 eae 1 (50-45) 
Sy =5,,/1+—+—2—— = (189.5304), 1+ -+ = —_ 0115 
%» n Wns 8 (4200) a 


/! Curve Fitting, Regression and Correlation // 347 


Therefore, the 99% prediction interval for y, for x = 50 m/s is 
tts) = 738.9167 + (3.707)(201.5584) = 738.9167 + 747.1770 =—8.2603 to 1486,094 


Hence, with 99% confidence we can state that the predicted force for a velocity of 50 m/s is between 
—8.2603 and 1486.094. We note here that, this interval is much wider than the one for the mean value of y for 
x = 50 m/s computed from Example E7.6, which was 484.6684 to 993.1650 (N). This always happens to be 
true and in general the prediction interval for predicting a single value of y is always larger than the 
confidence interval for estimating the mean value of y for a certain value of x. 


7.14 LINEARIZATION OF NON-LINEAR RELATIONSHIPS 


Linear regression provides a powerful technique for fitting a best line to data. There exists many situations 
in science and engineering that show the relationship between the quantities that are being considered is 
not linear. There are several examples of non-linear functions used for curve fitting. A few of them were 
described in Table 7.1. 


Non-linear regression techniques are available to fit these equations in Table 7.1 to data directly. A 
simpler alternative is to use analytical manipulations to transform the equations into a linear form. Then 
linear regression can be used to fit the equations to data. 


For instance, y = bx” can be linearized by taking its natural logarithm to give 


fn y=f!nb+ménx (7.39) 


A plot of én y versus /n x will give a straight line with a slope of m and an intercept of /n bas 


shown in Fig. 7.16. 


Iny 


Slope =m 


hn =Inb 


Fig. 7.16: Linearized version of the exponential equation y = bx™ 


In x 
e) 


Many other non-linear equations can be transformed into linear form in a similar way. Table 7.2 lists 


several such equations. 


Table 7.2 
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Linear form Relationship to 
yp=atbx 


(ry) = mbn(x) + £n(c) p=ln(y), X= En(x) 


b=m,a= nc) 
4. | 1 


b=m,a= ln(c) 
y= 


Non-linear 
equation 


Values for least- 
squares regression 


£r(x,) and £n(y,) 


x, and fn(y,) 


x, and fn(y,) 
] 


J=log’y), X=x 
b=m,a=\log(c) 


NTL Xeaten 


log x; and log y; 


ig 
logy=logc +x logd ; x; and log y; 
pacetdlx y=ct+dh pay and t=Vx a[x; and y, 
a=candb=d 


y=ct+dV/x 
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y 


y =a + bflog(x)] 
y=at bx” 


a+bx+ cx, 
rae a+ bx + cx’ 


y=atbx+cx + dx 
log(y) = a + bx + cx +dx 


Fig. 7.16(a): Non-linear data curves 


Example E7.8 


Fit y = cx” (power function) to the data in Example E7.1 using a logarithmic transformation. 
SOLUTION: 


The data can be set up in tabular form as shown in Table E7.8. 


Table E7.8 


ee ee ee (log x Mlog y) 


SS ee ee 


The means are computed as 
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=1.5757 


| 


_ Llogx _ 12.606 
- 8 


i 


= 2.5604 


Il 


_ logy 20.483 
y = 7 
: nt 8 
The slope and the intercept are then calculated using Eqs. (7.13),@.14), (7.15) and (7.16): 
SS. _ n&(log x; (log y;) — (2 log x; (2 log y;) 2 8(33.585) — (12.606)( 20.483) 


: ~ - = 2.0055 
Sy na log x; —(Zlog x;)” 8(20.516) — (12.606)~ 


j= 


a= y —bx = 2.5604 — 2.0055(1.5757) = -0.5997 


The least-square fit is 
log y = -0.5997 + 2.0055 log x 
The fit along with the data is shown in Fig. E7.8. 
Transforming to the original coordinates, we have 
c= 10°) = 0.2514 and m = 2.0055 
Hence the least-squares fit is 
= 0.25145 


7.15 POLYNOMIAL REGRESSION , : — 


The least-squares procedure described in earlier sections can be readily extended to fit the data to a higher 
order polynomial. Consider a second-order polynomial or quadratic: 


y=atbx+crt+e (7.40) 


The sum of the squares of the residuals is 


S, = ¥(; -a—bx; - cx?) (741) 
i=] 

To generate the least-squares fit, we take the first derivative of Eq. (7.41) with respect to each of the 
unknown coefficients of the polynomial. 


Os - 

= re (y, —a—bx, —cx?) 

as n 7 
<= -2) x.(y, —a—bx, — cx?) (7.42) 

db i=l 

Os nN 

5 ‘= -2¥) x7 (y; >a =p, — cx?) 
P 


i=] 


Equations (7.42) are set equal to zero and rearranged to obtain the following set of normal equations: 
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Ble [Eelnte ww 
Belo(Be)o Belge 


Equation (7.43) are all linear equations in three unknowns: a, b and c. These coefficients a. b and c can 
be determined directly from the observed data. 


The above procedure can be easily extended to an m" order polynomial as in 
y=athx+crt+de+-+z"+e (7.44) 


Hence, the determination of the coefficients of an m' order polynomial is equivalent to solving a system 
of (u. + 1) simultaneous linear equations. 


The standard error in this case is given by 


S, 


S — EEE 
it |) 


(7.45) 


ylx ad 
The coefficient of determination, 7°, can be computed for a polynomial regression with Equation (7.25). 


Example E7.9 


Fit a second-order polynomial to the data in Table E7.9 and determine the total standard deviation, the 
standard error of the estimate and the correlation coefficient. 


Table E7.9 


se | | 1 2a se 
MEMES BAe iis 


SOLUTION: 


Tables E7.9(a) and (b) shows the computations for an error analysis of the quadratic least-squares fit. 


Table E7.9(a) Table E7.9(b) 


Hence, the simultaneous linear equations are 
6 15 SSil4@ 153 
15 55 225|{b+=4 586 
S59y225 | Dil le 2488 
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Refer to Appendix-/ (Cramer’s rule for solving a system of linear algebraic equations). 


BIS 55 
Here Dans 55.-225|= 3920: 
55-225 «979 
l3ee8 aes 
D, =| 586 55 225)=9800; 
2488 225 979 
6 153 55| 
| 
D,=|15 586 225/=9884; 
55 2488 979 
6 (15 i153 
D,=|15 55 586|=7140 
55 225 2488 
Therefore, 
ee A 
D 3920 
b= 22 = 284 9 5944 
D = 3920 
D 
eee end 
D 3920 


These equations can also be solved to determine the coefficients using MATLAB: 
SSA = [6 15:55; 15 55.2252535. 225 979), 
>>b = [153; 586; 2488); 
x=Alb 
x=2.5 2.5214 1.8214 
or a= 2.5, b=2.5214, and c = 1.8214 
Hence, the least-squares quadratic equation for this problem is 
y=25 + 25214x + 1.8214 


The standard error of the estimate is based on the regression polynomial given by Eq. (7.48), where 


S, =X(y-5)°. Here, we have 


S 3.229 
Syi, =,/———— =, |-—_ = 1.0374 
! Were cay {6-@+) 
The coefficient of determination is given by Eg. (7.25) 


2 S125, _ 24935-3229 


= 0.998 
y 2493.5 
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where S,=E(y,- vy 
ce S, = 29 - $Y 


Therefore, the correlation coefficient r = J0.9987 = 0.99935. These results show that 99.935% of the original 
uncertainty has been explained by the model. 


7.16 QUANTIFICATION OF ERROR OF LINEAR REGRESSION 


Noting that the sum of the squares is defined as 


S,= YO; —a-bx,; -— cx; i (7.46) 


Equation (7.46) is similar to the equation 


" 
SG.) (7.47) 

ei 
In Eq. (7.47), the squares of the residual represented the squares of the discrepancy between the data 
and a single estimate of the measure of central tendency (the mean). The squares of the residual represent 
the squares of the vertical distance between the data and another measure of central tendency (the straight 
line). If the spread of the points around the line is of similar magnitude along the entire range of data and 
the distribution of these points about the line is normal, then the least-squares regression will provide the 
best estimates of a and b. This is known as the maximum likelihood principle. Also, if these criteria are 

met, a standard deviation for the regression line can be determined as 


A es (7.48) 


where S), _is called the standard error of the estimate. 


The —. between the S and S. quantifies the improvement or error reduction due to describing 
the data in terms of a straight line rather than as an average value. The difference is therefore normalized to 


Sto give 


os (7.49) 


where r? is called the coefficient of determination and r is the correlation coefficient. For a perfect fit, 
S = 0, and r° = 1, indicating that the line explains 100% of the variability of the data. For 7 = 0, S, = 5, and 


the fit represents no improvement. An alternative formulation for ris given by 


Zeon Es) By] 
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Ee ( [Ex {3) 


7—) 


Example E7.10 


Determine (a) the total standard deviation, (b) the standard error of the estimate and (c) the correlation 
coefficient for the data in Example E7.1. 


SOLUTION: 
Table E7.10 shows the data and summation to compute the goodness-of-fit statistics. From Example E7.1, 
we have a = —236.50, b = 19.5083, x =45 and y = 641.375. 


Table E7.10 


—41.4167 
153.6667 
348.7500 
543.8333 
738.9167 


934.0000 
1129.0833 
1324.6667 


where S is the total sum of the squares of the residuals between the data points and the mean. 


1813945. 
5, = | EEBESRES are 1746 


Hence 


The standard error of the estimate is 


38115 1.8906 
328758.8906 
69366.3906 
7987.8906 
11 13.8906 
332496.3906 
35957.6406 
657 112.8906 


S, 215530.583 
So ee | = 164.1320 
n-2 8-2 


Since S| < S\, the linear regression model has merit. 


The coefficient of determination 7° is given by Equation (7.49) 


2 S; mel 
S; 


r 


— 1815945.875—215930:583 
1813945.875 


4279.3403 
7338.7778 
355/502) 
66.6944 
17139. 0736 
80656.0000 
88853.6736 
16341.3611 


= 0.8812 
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or r=~0.8812 = 0.9387 


These results indicate that 93.87% of the original uncertainty has been explained by the linear model. 


7.17 MULTIPLE LINEAR REGRESSION — ; 


Consider a function y which is a linear function of x, and x, as in 
y=at+bx,+cx,+e Gish) 


Equation (7.51) is quite useful in fitting experimental data where variable being studied is often a function 
of two other variables. For this two-dimensional case, the regression line becomes a plane. The best values 
of the coefficients are obtained by formulating the sum of the squares of the residuals: 


S,= » (y; —a—bx,; —cx,;)° (7.52) 


i=] 


Differentiating equation (7.52) with respect to each of the unknown coefficients, we get 


eh) : 

i) Vege, — OO, 

da 2 ol 
as n 

— x; 6 t= Dis — CO, ) 
ab > aed cas 
as, 


n 
5) = 2), Xp (yj — a— bx j — CX ;) 
e i=l 


The coefficient giving the minimum sum of the squares of the residuals are obtained by setting the 
partial derivatives equal to zero and expressing the result in matrix form as 


> a Mi Sees, b;= ey (7.53) 


Example E7.11 


The following data was generated from the equation y = 7 + 3x, + 4x,. Use multiple linear regressions to fit 
this data. 
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SOLUTION: 
Table E7.11(a) 


i J 


: ; , 
4 2 1 
l 5 o 
e 20 
i 
6 
0 


The summations required for Eq. (7.53) are computed in Table E7.11(a) as shown above. Substituting these 
values in Equation (7.53), we get 


BWR MWe NY © 


OnNnNNA fe N— OO 


8.17 19a 183 
17 75 40|45b;+=4504 
19° 40 soar) le mi) 


which can be solved using Cramer’s rule or MATLAB. 


Refer to Appendix-/ (Cramer's rule for solving a system of linear algebraic equations). 


Sal? sole 
D=|17 75 40)}= 6180; 
[9 “40.65 


183 17 19 
=|504 75 40|= 43260; 
513 40 65 


|8 183 19 
D,=|17 504 40) =18540; 
19 513 65 


8 17 183 
17 75 504) = 24720 
19 40 513 


ae 
I 


D, _ 43260 _ 


DD 6180 — 
D, _ 18540 _ 
D~ 6180 — 
_ Ds _ 24720 _ 


D 6180 
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MATLAB Program: 


>>A = [8 17 19; 17 75 40; 19 40 65}; 
>>B = [183; 504; 513]; 


x=A/B 
se=7er oy 3 anel = ead! 
or a=". p= 3). and c=4. 


which is consistent with the original equation from which the data was derived. 


7.18 WEIGHTED LEAST-SQUARES METHOD 


Referring to the sections 7.3 and 7.4 and assigning weights w, to each error, e. (i= 1, 2, .... 2) in Equation 
(7.4) such that Zw, = 1. 


Equation (7.5) can be written as 


S.= os w; Ly; —(a+ bx; I (7.54) 


i=! 


For Sto be a minimum, we have 


aS, as, 


= 0 and =) (7.55) 
da db 
We obtain the normal equations as 
a+ bLw,x; = Zw, y; (7.56) 
aLw,x, + biw,x, = Zw;x,y; (7.57) 


The solution of Equations (7.56) and (7.57) gives the values for a and b. These values give the minimum 
of S_ with respect to the weight w.. 


Similarly, for a parabolic equation, we need to minimize 
S, = Zwjly, —(at bx, + 0x7)? (7.58) 
where 2w.= 1. 
For S_ to be minimum, we have 


OS eign 2 0 (7.59) 


da ob dc 


On simplification, we obtain the following normal equations to determine a, b and c. 


a+ b2w;x; + clw;,x? = LW); (7.60) 
aLw;x; + bIw,x? a clw;x? = Zw,%; yj; (7.61) 
(7.62) 


a&w,x; + biw;x; + clw;x; = lw; x,y; 
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7.19 ORTHOGONAL POLYNOMIALS AND LEAST-SQUARES 


APPROXIMATION 


The previous sections considered the problem of least squares approximation to fit a collections of data. 


This method is also applicable for continuous data. 


7.20 LEAST-SQUARES METHOD FOR CONTINUOUS DATA 


Let v = f(x) be a continuous function on [a, b] and it is to be approximated by the n' degree polynomial. 
(7.63) 


ySa,taxtaxrtortax" 
Here the sum of the squares of residuals S is given by 
b 2 te 
S= I w(x) Ly —(apx + aox° ++a,x" IV dx 
where w(x) is a suitable weight function. 
The necessary conditions for minimum S are given by 


dd, 0a, 0a, 


Equation (7.65) give the normal equations as 


0 


b 9 
-2{ w(x)Ly — (ap + xt ax° +---+a,x" dx =0 
a a 
b 2 n 
-2{ w(x)Ly — (dp +ay,xX+ ax +-"+a,x )jxdx=0 
a 


b 
-2] w(x)Ly — (dy + xt ae tert a,x")] x’ dx=0 


b g) 
-2f Wx)Ly — (dp + ayxt+ ax” +--+ a,x") x" dx=0 
After simplification these equations reduce to 


b b b b 
ao w(x) dx+a, Le xw(x) dx +--+ a, [ x" w(x) dx =| w(x) y dx 


b b 4 b b 
dp I xw(x) dx+ a, J. Wk dat tae f. x" W(x) dx =| w(x)xy dx 


b b b 
ay x’ w(x) dx+ a, lL x W(x)axte--+a, I. x"*2 w(x) dx =|” w(x)x? ydx 


b b b b 
Ay J. x" w(x) dx+ a, I, x W(xdrte-ta, I. x" w(x) dx =| w(x)x" y dx 


(7.64) 


(7.65) 


(7.66) 


(7.67) 


Since w(x) and y = f(x) are known, equation (7.67) form a system of linear equations with (2 + 1) 


unknowns a,, d,, ...., a,. This system of equations possesses a unique solution. If 


4 * = cd * 
a, = Ap» a, _ a, . seey A = a, 
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is the solution for a,, a,, ...., @, then the approximate polynomial is given by 
y=a,t+a,xt mee Et a,x" 
Example E7.12 


Construct a least-squares quadrate approximation to the function TCO =singemon (0, 1]. 


SOLUTION: 


The normal equations for P,(x) = a,x? + a * +a, are 


1 1 i iy 

ay]. ldx+a, J, xdx + a, i xdx= J, sin 1 x dx 1 
1 ae l 3 i 5 

qy[ xdx+a,] x dx+ay) x dx=|_ xsinnxdx (E2) 
ee: L 35; a Me es (E3) 

ay| x dx+a,] x a+ an| x dx= |x? sin Tt x dx is 


Performing the integration gives 


3 a a ue = z (E4) 
A oe aa : 
ay + a ar - : (E35) 
= — a5 = lew 
Sir” aie 
= a, oe te _==4 (E.6) 
S 0 4 1 5 2 ne " 
Equations (E.4), (E.5) and (E.6)in three unknowns can be solved to obtain 
2 — 
pee ste 201050465 
- 
0-60n* 
and eae eo a0e = 4.1225] 
T 


Consequently, the least squares polynomial approximation of degree 2 for f(x) = sin px on [0, 1] Is 
P(x) = 4.1225 1x? + 4.12251x — 0.050465. 


7.21 APPROXIMATION USING ORTHOGONAL POLYNOMIALS 


In section 7.19, a function is approximated as a polynomial containing the terms 1, x, x°, ...., x”. These terms 
are called base functions, since, any function or even discrete data are approximated based on these 


functions. 
Here, we assume that the base functions are some orthogonal polynomials f(x), f,(x), .-... f(x). Let the 


given function be approximated as 
Y= ah) +a f+ +4, ,0) (7.68) 


where f(x) is a polynomial in x of degree 1. Then the residue is given by 
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5 =f WL ap folx) +a, fd tot a, fy de (7.69) 
For minimum S, the conditions are given by 


OS 9 8 a9 = (7.70) 


Equation (7.70) yield the following normal equations: 


-2” wx Ly ~ {ap fo(2) + a fi ++ ay fy D1 fox) de = 0 


-2f" W(X) LY — {Ay fo (X) + a A(X) toot a, F(X) IF (x) dx = 0 


-2)" w(x)Ly — {ap fo(x) +a, f,(x) +++ 4, f, OOF, &) dx = 0 (7.71) 


After simplification, the i equation can be written as 


ayf wx) fold fadde+ af” wx) ff Of @dde+~ 


+4; I w(x) fy (x) dic +++ a,{ w(x) f(x) f; (x) dx = f. w(x)y f(x) dx (7.72) 
1 =O see 


A set of polynomial { f,(x), f\), ...-,f,Q)} is said to be orthogonal with respect to the weight function 
w(x) if 


; 0, if it j 
(x) fj (x)w(x) dx = 5 pa (7.73) 
J, Oh; [, Pwd, if i= j 
Using Equation (7.73), Equation (7.72) can be written as 
b b 
a,) wf? (dx =f wo fia) de, i=0,1,2,...47 
b 
J wy fied de 
Hence, a; = 75, _ ee P=O 2a (7.74) 
J, woof? 0) ae 
From Equation (7.74), we can find the values of a,, ,, ...., a, and the least squares approximation is 


obtained by substituting these values in Equation (7.68). However, the functions TOSCO... f(x) are 
unknown. Several orthogonal functions are available in literature. A few of them are given in Table 7.7. 


Any one of the orthogonal functions can be selected to fit a function dependent on the given problem. 
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Table 7.7: Some standard orthogonal polynomials 


Legendre 
Leguerre 
Hermite 
Chebyshev 


7.22 GRAM-SCHMIDT ORTHOGONALIZATION PROCESS 


Let f(x) be a polynomial in x of degree i and { f,(x)} be a given sequence of polynomials. Then the sequence 


of orthogonal polynomials over {fr (x)} the interval [a, b] with respect to the weight function w(x) can be 


generated by the following equation 
es isl ; 
f, @=x -)' a;,f, (G2 eer —a eee eee (7.75) 
r=0 


where the constants are a,, and a (Ge) = il. 


To obtain a,, we multiply Equation (7.75) with w(x) f,, (x),0<k <i- 1 and integrating over [a, b], we 


obtain 
i-l 
* * b i p* b * m ? 
ie (x) f, (w(x) dx =| xf, (x)w(x) dx—{ wee (x) fi, (wr) dv (7.76) 
a a a Aa 
Using the property of orthogonal polynomial, Equation (7.76) becomes 


[XK weaddr-J a, f C)w(de=0 


or , BRACE: easels O<n<i-1 (7.77) 


f? £209 we) de 


Hence, the set of orthogonal polynomials { f; (®)} are given by 


fe! 
i-l 
f' (o=x%-Ya,f en 
reid) 
[x Reward = 
where a: le 2 (x) w(x)dx 


For the discrete data, the integral is replaced by summation. 
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Example E7.13 
Use Gram-Schmidt orthogonalization process to find the first two orthogonal polynomials on [—1, 1] with 
respect to the weight function w(x) = 1. 


SOLUTION: 
Let fio! 
Hence es) =xX- ayy fo (x) 
1 
i ie GR 
where ao = - =0 
I dx 
= 
or Ale = 


The second orthogonal polynomial is given by 


fy (x) = x" - ayy fy (0) - anf, 


[Pax 1 
where Any = i =. 
: a 8 
dx 
= 
ie 
I Sogeany 
i ——=0 
| x dx 
=| 
Hence Pgh mel ae —1) 
: geteae = 


Thus, the first two orthogonal polynomials are 


fe i * 1 
fii@al f @ =x and 2 G@)— 30" —1) 


7.23 ADDITIONAL EXAMPLE PROBLEMS AND SOLUTIONS 


Example E7.14 


Determine the equation to the best fitting exponential curve of the form y = ae” for the data given in Table 
E7.14. 


Table E7.14 


Li] 3] 5] 7] 91 
ry Las [ios [95 [as [80 
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SOLUTION: 
Refer to Table E7.14(a). 


Table E7.14(a) 


HEEE 


l 
3 
oP OS 
y ie 
- 


Given y=ae" 
The normal equations are 
Llogy,=5A+Bxrx, (E.1) 
Lx, logy,=AUx,+ BX x? (E2) 
where A=log a and B=b loge. 


Solving the two normal Equations (E.1) and (E.2), we get 
A = 2.0802 and B =-0.0203 
Hence a= antilog of A = 107 = 120.2818 


—0.0203 


and b= B/loge= = —0.0075. 


Hence, the fitted equation is y = 120.2818 e°®* 


Example E7.15 


For the data given in Table E7.14, find the equation to best fitting curve of the form y = ab". 
Table E7.15 


SOLUTION: 


The calculations are shown in Table E7.15(a). 
Table E7.15(a) 


129.2062 
149.5433 
173.0814 
200.3144 
231.8555 


884.0108 


E 
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The normal equations are obtained as follows: 
y=ab 
Taking logarithms (base 10) on both sides of the above Equation (E.1), we get 
log y= log a+ x log b 


or Y=A+ Bx 
where Y= log y, A= log a and B= log b. 
Hence, the normal equations are 
LY=nA + BXx 
UxY = Adx + Bir 


Substituting the values from Table E7.13(a) into Equations (E.4) and (E.5), we have 


11.1920 =5A + 15B 
34.2113 = 154 +55B 
Solving Equations (E.6) and (E.7), we obtain 
A= 2.0478 and B= 0.0635 
Hence a= antilog of A = 10°” = 111.6349 
b=antilog of B = 10° = 1.1574 


(E.1) 
(E2) 
(E3) 
(E4) 


(E5) 


(E6) 
(E7) 


Hence the fitted equation is y = 111.6349(1.1574)*. The estimated values of y (denoted by y ) are shown in 


the last column of Table E7.13(a). 


Example E7.16 


For the data given in Table E7.15, find the equation to best fitting curve of the form x“ = b. 


Table E7.16 


[iso TrooT coy oT 10 


SOLUTION: 
See Table E7.16(a). 


Given w=b 


(E.1) 
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Taking logarithms (to the base 10) on both sides of the above Equation (E.1), we get 
log x +a log y=log b (E.2) 


| 
or he be eee 
a a 


(ES) 


The normal equations are given by 


XY=6A + BIX (E4) 
2XY = AXX + BXX? (E5) 
where Y=logy, X=logx,A= z log b and B = -I/a. 
a 


Solving Equations (E.4) and (E.5), we obtain 
A = 1.4865 and B = -0.6154 
Therefore _a=—1/B =~1/-0.6154 = 1.6250 
and b = antilog of (aA) = 1061-6X1-486) — 260.3529 
Hence, the fitted equation is xy! = 260.3529. 


Example E7.17 

Fit the following data: 
BRIS 
fy [11] 16 | 19 | 26] 


to a straight line by considering that the data (2, 16) and (4, 19) are more significant or reliable with weights 
6 and 11 respectively. 

SOLUTION: 

Weighted Least Squares Method 

Let the straight line be y = a + bx. The normal equations are 


axw, + biwx, = Lwy, (E.1) 
and aLwx, + biw= Xwx,y, (E.2) 
The values in Equations (E.1) and (E.2) are calculated as shown in Table E7.17. 
Table £7.17 


The normal equations are 
19a + 62b = 342 (E.3) 
and 62a + 236b = 1184 (E.4) 
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The solution of Equations (E.3) and (E.4) gives 
a= 11.4125 and b = 2.0188 
Hence, y= 11.4125 + 2.0138x 


Estimation of error 


(Absolute error) 


5) 0 
2 | 16 | 6 
4/19} 1l 
64). 26ulel 


Sum of squares of errors 


Example E7.18 


Consider the Example E7.17 with the modified weights 300 and 50 instead of 6 and 11. 


SOLUTION: 


The modified calculations are shown in Table E7.18 


Table E7.18 
[x {y | w | wx | wx" | wy | wry | 


OTe 0 0 11 0 
2} 16|30| 60) 120| 480) 960 
4 | 19 | 50 | 200 | 800} 950 | 3800 
Oe 2642 21 6| 36 26 | 156 


The normal equations are 
82a + 266b = 1467 (E.1) 
and 266a + 956b = 4916 (E.2) 
The solution of Equations (E.1) and (E.2) gives 
a= 12.4144 and b= 1.6881 
Hence, y= 12.4144 + 1.6881x 


Estimation of error 


x |y_| w | Predicted y | Absolute error | (Absolute error) 


Lea el 
16 | 30 
197450 
Ga267} 1 


Sum of squares of errors 


It is noted that when the weights on x = 2 and x = 4 are increased then the absolute error in y are reduced at 


these points, but, the sum of squares of errors is increased due to the less importance of the data (0, 11) 
and (6, 26). 
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7.24 SUMMARY : | 


In this chapter, we have reviewed the relationship between two variables in two ways: (1) by using the 
regression analysis and (2) by computing the correlation coefficient. It was shown that the regression model 
can be used to evaluate the magnitude of change in one variable due to a certain change in another variable. 
The regression model also helps to predict the value of one variable for a given value of another variable. 
The correlation coefficient shows how strongly two variables are related. It does not, however, provide any 
information about the size of change in one variable as a result of a certain change in the other variable. 


We have also reviewed the inferential methods in regression and correlation. 


PROBLEMS 


P7.1 Table P7.1 gives information on the monthly incomes (in hundreds of dollars) and monthly telephone 
bills (in dollars) for a random sample of 10 households. 


Table P7.1 


ee 7" 45) 350i | 30s) 14 | 40) 15 [sero 

| Telephone bill | 140 | 171 | 70 | 94 | 25 | 159 | 41 | 78 | 98 
Use least-squares regression to determine the coefficients a and b in the function y = a + bx that 
best fits the data. 


[Pap The following table P7.2 lists the annual incomes (in thousands of dollars) and amounts of life 
insurance (in thousands of dollars) of life insurance policies for six persons: 


Table P7.2 


150 | 500 | 75 

(a) find the regression line y = a + bx with annual income as an independent variable and amount 
of life insurance policy as a dependent variable. 

(b) determine the estimated value of life insurance of a person with an annual income of $50,000. 


7.3 Find the least-squares regression line for the data on annual incomes and food expenditures of 
seven households given in Table P7.3. Use income as an independent variable and food expenditure 
as a dependent variable. Ail data is given in thousands of dollars. 2 


Table P7.3 


Income: x 35 | 50 | 22 | 40 | 16 | 30 | 4 
Expenditure: 


P7.4 Table P7.4 gives data on age and crown-rump length for the fetuses. Use least-squares regression 
to determine the coefficients a and b in the function v = a + bx that best fits the data: 


Table P7.4 


ig | 19 pgs" | _28 
227 | 234 | 279 
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PTS 


7.6 


P7.7 


Grade on first mid-term examination (x) 


Grade on second mid- 


The following data in Table P7.5 refers to the number of hours that 10 students studied for a math 
test and their scores on the test: 


Table P7.5 


Hours studied |_| pr faa fiat Tf ata od 
[Test score | 21 | 83 | 90 | 60 | 45 | 38 | 74 | 66 | 59 | 32 | 


(a) find the equation of the least-squares line that approximates the regression of the test scores 
on the number of hours studied. 
(b) determine the average test score of a person who studied 15 hours for the test. 


The following Table P7.6 shows the first two grades, denoted by x and y respectively, of 10 students 
on two mid-term examinations in applied statistics. 
(a) find the least squares regression line of y on x. 


Table P7.6 


| 60 | 50 | 80 | 80 | 70 | 60 | 100 | 40 | 90 | 70 | 
term examination (y) [ 80 | 70 | 70 | 90 | 50 | 80 | 95 _| 60 | 80 | 60 | 


Age (x) 


The following Table P7.7 shows ages x and systolic blood pressure y of 12 men. 
(a) determine the least squares regression equation of y on x 


(b) estimate the blood pressure of a man whose age is 45 years. 


Table P7.7 


P7.8 


Pp 


P7.10 


Blood pressure (y) 


Table P7.8 shows the respective weight x and y of a sample of 12 fathers and their oldest sons. 
(a) find the least-squares regression line of y on x. 


Table P7.8 


Weight of father, x (kg) | 65 | 63 | 67 | 64 | 68 | 62 | 70 | 66 | 68 | 67 | 69 | 71 | 
| Weight of son, y (kg) | 68 | 66 | 68 | 65 | 69 | 66 | 67 | 65 | 70 | 67 | 68 | 70 | 


Find the least squares regression line for the data on annual incomes and food-expenditures of 
seven households given in Table P7.9. Use income as independent variable and food expenditure 
as a dependent variable. The income and food-expenditures are in thousands of rupees. 


Table P7.9 


| Food expenditure y | 9 [15] 7[10{ 5] 8[ 85] 


A car manufacturing company wanted to investigate how the price of one of its car models 
depreciates with age. The company took a sample of eight cars of this model and collected the 
following information on the ages (in years) and prices (in hundreds of dollars) of these cars as 
shown in Table P7.10. 
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(a) 


(b) 
(c) 
(d) 


Table P7.10 


Paes alo! 2s) 6! 3 
Price | 16 | 74 | 40 | 19 | 120 | 36 | 33 | 86 


find the regression line } = a+bx with price as a dependent variable and age as independent 
variable 


give a brief interpretation of the values of a and b calculated in part (a) 
predict the price of a 7-year old car of this model 
estimate the price of an 4-year old car of this model. 


For problems P7.11 to P7.20 do the following: 


Fit a least squares regression line of the form $=a+bx for the data given in Tables P7.1 to P7.10 


respectively 


P7.11 
P7.12 
P73 
P7.14 
F715 
P7.16 
P7.17 
P7.18 
719 
P7.20 
P7.21 
P7.22 
P7.23 
P7.24 
P7.25 
P7.26 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 


. Assume x as the independent variable and y as the dependent variable. 


give a brief interpretation of the values of a and b calculated in } = a+bx. 


compute the standard deviation of the sample errors, s .. 
compute the error sum of squares, SSE. 

compute the total sum of squares, SST. 

compute the regression sum of squares, SSR. 

compute the coefficient of determination, 7. 

compute the correlation coefficient, r. 


For the data given in Table P7.1. 
For the data given in Table P7.2. 


For the data given in Table P7.3. 


For the data given in Table P7.4. 


For the data given in Table P7.5. 


For the data given in Table P7.6. 


For the data given in Table P7.7. 


For the data given in Table P7.8. 


For the data given in Table P7.9. 


For the data given in Table P7.10. 


Construct a 95% confidence interval for B for the data in Problem P7.1. 


Construct a 95% confidence interval for B for the data in Problem P7.2. 


Construct a 95% confidence interval for B for the data in Problem P7.3. 


Construct a 95% confidence interval for B for the data in Problem P7.4. 


Construct a 95% confidence interval for B for the data in Problem P7.5. 


Construct a 95% confidence interval for B for the data in Problem P7.6. 
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P7727 
P7.28 
P7.29 
P7.30 
P7351 
Pi.32 
P7.33 
P7.34 
P7355 
P7.36 
P7357 
P7.38 
P7592 
P7.40 
P7.41 


P7.42 


P7.43 


P7.44 


P7.45 


P7.46 


P7.47 


Construct a 95% confidence interval for B for the data in Problem P7.7. 

Construct a 95% confidence interval for B for the data in Problem P7.8. 

Construct a 95% confidence interval for B for the data in Problem P7.9. 

Construct a 95% confidence interval for B for the data in Problem P7.10. 

Test at the 1% significance level if the slope of the regression line in Problem P7.1 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.2 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.3 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.4 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.5 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.6 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.7 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.8 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.9 is positive. 
Test at the 1% significance level if the slope of the regression line in Problem P7.10 is positive. 


Refer to Problem P7.1 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.2 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.3 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.4 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.5 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.6 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.7 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 


P7.48 


P7.49 


P7.50 


P7.51 


P7.52 


P7s3 


P7.54 


P7355 


P7.56 


| ey 


P7585 


P7539 


P7.60 


P7.61 
P7.62 
P7.63 
P7.64 
P7.65 
P7.66 


// Curve Fitting, Regression and Correlation // 371 
Refer to Problem P7.8 


(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.9 

(a) construct a 90% confidence level for B 

(b) test at the 5% significance level if B is negative. 
Refer to Problem P7.10 

(a) construct a 90% confidence level for B 


(b) test at the 5% significance level if B is negative. 


Refer to Problem P7.1. Find a 99% confidence interval for the mean value of telephone bill. y. when 
the income, x = $4200. 


Refer to Problem P7.2. Find a 99% confidence interval for the mean value of life insurance. y, when 
the annual income, x = $53000. 


Refer to Problem P7.3. Find a 99% confidence interval for the mean value of expenditure, y. when 
the income, x = $42000. 


Refer to Problem P7.4. Find a 99% confidence interval for the mean value of y, when the value of 
x= 20. 


Refer to Problem P7.5. Find a 99% confidence interval for the mean value of test score, y. when the 
hours studied, x = 15. 


Refer to Problem P7.6. Find a 99% confidence interval for the mean value of grade on second mid- 
term examination, y, when the grade on first mid-term examination, x = 82. 


Refer to Problem P7.7. Find a 99% confidence interval for the mean value of blood pressure, \. 
when the age, x = 65. 


Refer to Problem P7.8. Find a 99% confidence interval for the mean value of weight of son, y, when 
the weight of father, x = 72. 


Refer to Problem P7.9. Find a 99% confidence interval for the food expenditure, vy, when the income. 
x = Rs.41,000. 


Refer to Problem P7.10. Find a 99% confidence interval for the price, y, when the age of the car, 
x =7 years. 

Fit y = bx” (power function) in Problem P7.1 using a logarithmic transformation. 

Fit y = bx" (power function) to the data in Problem P7.2 using a logarithmic transformation. 

Fit y = bx” (power function) to the data in Problem P7.3 using a logarithmic transformation. 

Fit y = bx” (power function) to the data in Problem P7.4 using a logarithmic transformation. 

Fit y = bx” (power function) to the data in Problem P7.5 using a logarithmic transformation. 


Determine the coefficient of the polynomial y = a + bx + cx’ that best fit the data given in the 
following table. 
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ats Ts 7 
ry Pat st [sas Pora 


Determine the standard error of the estimate and correlation coefficient. 


P7.67 The following data were collected in an experiment to study the relationship between shear strength 
in kPa (y) and curing temperature in °C (x). 


px {138} 140 1.46 
ry [5.392 [5.612 [5.671 [5.149 | 448i 


(a) fit a least-squares quadratic model of the form y = a + bx + cx’ to the above data 


a 
= 
N 
Oo 


(b) using the equation, compute the residuals. 

(c) compute the error sum of squares and total sum of squares. 
(d) compute the error variance estimate. 

(e) compute the coefficient of determination. 


P7.68 The following data were collected in an experiment to study the relationship between the number 
of kilograms of fertilizer (x) and the yield of tomatoes in bushels (y). 


(a) fit a least-squares quadratic model of the form y = a + bx + cx’ to the above data. 


nt 
(b) using this equation, compute the regression sum of squares i 3,-y). 
i=l 


n n 

(c) compute the error sum of squares » (y= $,)° and total sum of squares > (y; - yy : 
i=l i=l 

(d) compute the error variance estimate (b) + (c). 


(e) compute the coefficient of determination, r’. 


P7.69 Fit a least-square parabola y = a + bx + c’ to the following data: 


40112 43-44 [5 _[s 
fy | 24] 21 | 3.2] 5.6] 93 [ 146 [219 | 


Determine the coefficient of determination. = 


P7.70 The following table gives the data collected in an experiment to study the relationship between 


the stopping distance d(m)_ of an automobile traveling at speeds v(km/hr) at the instant the danger 
is sighted. 


(a) fit a least-squares parabola of the form d= a + by + cv* to the data 
(b) determine the coefficient of determination. 


Speed v(km/hr) 32 | 48 64 80 | 96 Liz 


Stopping distance d(m) | 16.5 253 


E573 


P7.74 


P7.75 


F746 


P7.77 


P7.78 


P7.79 


P7.80 


P7.81 


P7.82 


P7265 
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For the data given in Problem P7.1, find a 99% prediction interval for y for a randomly select. 
value of x = 42. 


For the data given in Problem P7.2, find a 99% prediction interval for y for a randomly’ select 
Walle Of v= 53: 


For the data given in Problem P7.3, find a 99% prediction interval for y for a randomly select. 
value of x = 42. 


For the data given in Problem P7.4, find a 99% prediction interval for y for a randomly seleete. 
value of x = 20. 


For the data given in Problem P7.5, find a 99% prediction interval for \ for a randomly selecte: 
value of x = 15. 


For the data given in Problem P7.6, find a 99% prediction interval for y for a randomly selected 
value of x = 82. 


For the data given in Problem P7.7, find a 99% prediction interval for y for a randomly selectud 
value of x = 45. 


For the data given in Problem P7.8, find a 99% prediction interval for y for a randomly select «! 
value of x = 72. 


For the data given in Problem P7.9, find a 99% prediction interval for y for a randomly selected 
value of x = 41. 


For the data given in Problem P7.10, find a 99% prediction interval for y for a randomly selector 
value of x = 7. 


Use multiple linear regression fit of the form y = a + bx, + cx, for the following data: 


(Ee ee ee 
ra) 
peas bir hee baer ooa [as bar base ba 


Compute the coefficients, the standard error of the estimate, and the correlation coefficient. 


Use multiple linear regression fit of the form y = a + bx, + cx, for the following data: 


inthe Mme) 
pefotifof if 2f 72a 


Compute the coefficients, the standard error of estimate, and the correlation coefficient. 


Use multiple linear regression fit of the form y = a + bx, + cx, for the following data: 


Ponta nines 
ry [3] st7 [12 [21 1s [13 [26 | 27 | 24 | 


Compute the coefficients, the standard error of estimate, and the correlation coefficient. 
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P7.84 


P7.85 


P7.86 


PiS7 


P7.88 


P7.89 


P7.90 


Use multiple linear regression fit of the form y = a + bx, + cx, for the following data: 
Lx) 0| Oo/at | [2 PO eel 
| 0 | oy 1 ee 
ya) 23 ns | 19 | Oe ess] 0 sa von eae 


Compute the coefficients, the standard error of estimate, and the correlation coefficient. 


Use multiple linear regression fit of the form y = a + bx, + cx, for the following data: 


eco" 0 [ai] 2 [so eine 
egg Os oe ee 


Compute the coefficients, the standard error of estimate, and the correlation coefficient. 


For the data given in Table P7.86, find the equation to the best fitting exponential curve of the 
form y = ae”. 


Table P7.86 


For the data given in Table P7.87, find the equation to the best fitting exponential curve of the 
form y = ae™. 


For the data given in Table P7.88, find the equation to the best fitting exponential curve of the 
form y = ae”. 


Table P7.88 


[x | 2.2] 3] 4] 6] 7 
fy | 31 [38] 45 [68 | 84 


For the data given in Table P7.89, find the equation to the best fitting exponential curve of the 
form y = ab‘. 


Table P7.89 
PaPARIEIE ies 


For the data given in Table P7.90, find the equation to the best fitting exponential curve of the 
form y = ab’. 


Table P7.90 


x ese) sali] 
by ts 03 [32 [57 Por 
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P7.91 For the data given in Table P7.91, find the equation to the best fitting exponential curve of the 
form y = ab’. 


Table P7.91 
JAS) 
eal 2 arsifor 


P7.92 For the data given in Table P7.92, find the equation to the best fitting exponential curve of the 
form y =x = b. 


P7.93 For the data given in Table P7.93, find the equation to the best fitting exponential curve of the 
form y = xy = b. 


P7.94 For the data given in Table P7.94, find the equation to the best fitting exponential curve of the 
form y =xy' = b. 


Table P7.94 


|x | 232 | 178 | 99 | 66 | Si_| 


Lyf it [13 [18 | 2.2] 2.5 | 


P7.95 Find a non-linear relationship of the form y = a + b log x for the data given in Table P7.95. Determine 
the linear correlation coefficient. 


Table P7.95 


fy {06[ 5.1] 69] 10_| 


P7.96 Fit the following data to a straight line v = a + bx by considering the weights as given in the table. 
Compute the sum of squares of errors. 


ceo en ens] a | 7a 9a 
Py [21] 83 [90 | 60 | 45 | 38 [74 | 66 | 59 | 32 | 
es 
P7.97 Fit the following data to a straight line y = a + bx by considering the weights as given in the table. 
Compute the sum of squares of errors. 

Px fis T asf asfaifso fiat apis) sot 0 
Py {50 [134 [ 107 [ 95 [90 | 44 | 120 [ 47 | 110 | 120 | 

(0) ee a 
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P7.98 


7.09 


P7.100 


P7.101 


P7.102 


P7.103 


P7.104 


Fit the following data to a straight line y = a + bx by considering the weights as given in the table. 
Compute the sum of squares of errors. 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


ME Ei iE 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


fy | 66 | 66 | 108} 106 | 160 _| 165 | 176 | 227 | 234 | 279 | 
se 2) | ne a es [S| 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


Px | a7 | 22a aia 094 
Ly [21 | 83 | 90 | 60 | 45 | 38 | 74 | 66 | 59 | 32 | 
es [ A | 2 2 i | ea 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


Ly | 21 | 83 | 90 | 60 | 45 | 38 | 
rnin aims 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


[x | 16] 45] 35 [31] 30] 
Ly | 50 | 134 | 107 | 95 | 90 | 
| a a 


Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 
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P7.105 Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


[x | 35 [50] 22 | 40) 
Ty Po yisl efi 
eae! 3] 43) 


P7.106 Fit the following data to a straight line y =a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


[x [io] 10] 137 137 18] 
pe pesilios [108 {106 | 160 
ov sal 2 | 3 


P7.107 Fit the following data to a straight line y = a + bx by considering the weights given in the table. 
Compute the sum of squares of errors. 


P7.108 Construct a least-squares quadratic approximation to the function y = e’ on [0, 1}. 
P7.109 Construct a least-squares quadratic approximation to the function y =x In x on [1, 3]. 


P7.110 Construct a least-squares quadratic approximation to the function y =x‘ on [0, 2]. 


] 
P7.111 Construct a least-squares quadratic approximation to the function y= — on[1. 3}. 
af 


P7.112 Construct a least-squares quadratic approximation to the function y =. + 3x + 2 on [0, 1]. 

P7.113 Use the Gram-Schmidt orthogonalization process to construct (+). 0,(v). 0,(¥), and 0,(x) for the 
interval [0, 1]. 

P7.114 Use the Gram-Schmidt orthogonalization process to construct ,(¥). 0,(¥). 0,(x). and ©,(\) for the 
interval [0, 2]. 


P7.115 Use the Gram-Schmidt orthogonalization process to construct ,(x). 0,(%), 0,(¥), and o,(x) for the 
interval [1, 3]. 


REVIEW QUESTIONS 


Explain the words simple and linear as used in simple linear regression. 

What is meant by independent and dependent variables in a regression model? 

What is the difference between exact and non-exact relationships between two variables? 
What is the difference between linear and nonlinear relationships between two variables” 
Explain the difference between a simple and multiple regression model. 

Explain the difference between a deterministic and a probability regression model. 

What is the reason for the inclusion of the random error term in a regression model? 


Describe the least-squares method and least squares regression line. 


eo Se YS 
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9. 
10. 
a 


12: 


13; 


14. 
£3; 
16. 
ie 
18. 


19, 
20. 


Zl, 


22. 
23. 


24. 


26. 


Describe the meaning and concept of SSE. 


What is the difference between y and yj? 

If two variables x and y have a positive linear relationship, what happens to the value of y as x 
increases? 

If two variables x and y have a negative linear relationship, what happens to the value of y when x 
increases? 

Describe the following terms: 

(a) population regression line 

(b) true values of A and B 

(c) sample regression line 

(d) estimated values of A and B that are denoted by a and b, respectively. 

Describe the assumptions made in a population regression model. 

What are the degrees of freedom for a simple linear regression model? 

Describe the meaning of coefficient of determination. 

Explain the meaning of SST and SSR. 


Describe the mean, standard deviation, and shape of the sampling distribution of the slope b of 
the simple linear regression model. 


Explain the meaning of a linear correlation coefficient between two variables. 
Describe the following terms: 

(a) no linear correlation 

(b) weak positive linear correlation 

(c) weak negative linear correlation 

(d) strong negative linear correlation 

(e) strong positive linear correlation 

(f) perfect positive linear correlation 

(g) perfect negative linear correlation 


What kind of plot is useful for deciding whether finding a regression line for a set of data points 
in reasonable? 


Identify one use of a regression equation. 

Regarding the variables in a regression analysis, 

(a) what is the independent variable called? 

(b) what is the dependent variable called? 

In the context of regression analysis, what is an 

(a) outlier? 

(b) influential observation? 

Identify the use of the coefficient of determination as a descriptive measure. 
For each of the sum of squares in regression, state its name and what it measures. 
(a) SST 

(b) SSR 

(c) SSE 
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STATE TRUE OR FALSE 
The graph of a linear equation with one independent variable is a line. (True/False) 
The graph of a linear equation slopes upward unless the slope is 0. (True/False) 
The value of the y-intercept has no effect on the direction that the graph of a linear equation 
slopes. (True/False) 
A simple regression is a regression model that contains both a and b. (True/False) 
The relationship between independent and dependent variable represented by the simple linear 
regression is that of a curve. (True/False) 
A deterministic regression model is a model that does not contain the random error term. 
(True/False) 
A probabilistic regression model is a model that does not contain the random error term. 
(True/False) 
The least-squares regression line minimizes the sum of errors. (True/False) 
The degrees of freedom for a simple regression model are n — 2. (True/False) 
The coefficient of determination gives the proportion of total squared errors (SST) that is explained 
by the use of the regression model. (True/False) 
The linear coefficient measures the strength of the linear association between two variables. 
(True/False) 
The value of the coefficient of determination is always in the range —I to +1. (True/False ) 
The value of the correlation coefficient is always in the range —1 to +1. (True/False) 
The symbol used for the linear correlation coefficient is 7”. (True/False) 
A value of r close to +1 indicates that there is a weak linear relationship between the variables. 
(True/False) 
A value of r close to zero indicates that there is a either no linear relationship between the variables 
or a weak one. (True/False) 
If y tends to increase linearly as x increases, the variables are positively linearly correlated. 
(True/False) 
If y tends to decrease linearly as x increases, the variables are positively linearly correlated. 
(True/False) 
If there is no linear relationship between x and y, the variables are linearly uncorrelated. 
(True/False) 
If there is a very strong positive correlation between two variables, a casual relationship exists 
between the two variables. (True/False) 
The independent variable in a linear relationship y = a + bx is x. (True/False) 
The dependent variable in a linear relationship y = a + bx is x. (True/False) 
The slope in a linear relationship y = a + bx is b. (True/False) 
The y-intercept of a linear relationship y = a + bx is a. (True/False) 
The y-intercept of a line has no effect on the steepness of the line. (True/False) 
(True/False) 


A horizontal line has no slope. 
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Zi: 


28. 


20 


34. 


Sink 


If a line has a positive slope, y-values on the line decrease as the x-value decrease. (True/False) 


Based on the least-squares criterion, the line that best fits a set of data points is the one having 


smallest possible sum of squared errors. (True/False) 
The line that best fits a set of data points according to the least-squares criterion is called the 
regression line. (True/False) 
Using a regression equation to make predictions for values of the predictor variable outside the 
range of the observed values of the predictor variable is called extrapolation. (True/False) 
One use of the linear correlation coefficient is as a descriptive measure of the strength of the 
linear relationship between two variables. (True/False) 
A positive linear relationship between two variables means that one variable tends to increase 
linearly as the other decrease. (True/False) 
A value of r close to —1 suggests a strong positive linear relationship between the variables. 
(True/False) 
A value of r close to 0 suggests at most a weak linear relationship between the variables. 
(True/False) 
A strong correlation between two variables doesn’t necessarily mean they are casually related. 
(True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. False 3.True 4. True 5.False 6. True /7. False 8. False 9. True 10. True 
11. True 12. True 13. True 14. False 15. False 16. True 17. True 18. False 19. True 20. True 
21. True 22. False 23. True 24. True 25. True 26. False 27. True 28. True 29. True 30. True 
31. True 32. False 33. False 34. True 35. True 


O00 


CHAPTER 


Chi-square Tests 


81 INTRODUCTION 


In Chapter 6, the tests of hypotheses about the mean, the difference between two means, and proportion, 
and the difference between two proportions were presented. In this chapter, we will describe three types of 
tests: 

1. Tests of hypotheses for experiments with more than two categories, called goodness-of-fit tests. 

2. Tests of hypotheses about contingency table, called independence and homogeneity tests. 

3. Tests of hypotheses about the variance and standard deviation of a single population. 

All these tests are performed by using the chi-square distribution. The description of chi-square 
distribution was presented in Chapter 4, Section 4.10. 


8.2 AGOODNESS-OF-FIT TEST | | . 


In this section, we describe how to make tests of hypotheses about experiments with more than two possible 
outcomes or categories. These experiments are called multinomial experiments and possess certain 
characteristics as described below. 

An experiment with the following characteristics is called a multinomial experiment. 


(a) The experiment consists of n identical trials (repetitions). 

(b) Each trial results in one of k possible outcomes or categories where k > 2. 
(c) The trials are all independent. 

(d) The probabilities of the various outcomes remain constant for each trial. 


An experiment of many rolls of a die is an example of a multinomial experiment. The experiment consists 
of several identical rolls (trials) and each roll (trial) results in one of the six possible outcomes. Each roll is 
considered independent of the other rolls and the probabilities of the six outcomes remain constant for 
each roll. A binomial experiment is a special case of a multinomial experiment. 


Observed and Expected Frequencies 

The frequencies obtained from the performance of an experiment are called the observed frequencies and 
denoted by O. The expected frequencies, denoted by E, are the frequencies that we will expect to obtain if 
the null hypothesis is true. The expected frequency for a category is obtained as 
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Esp (8.1) 
where n = the sample size 
p = the probability that an element belongs to that category if the null hypothesis is 
true. 


Degrees of Freedom for a Goodness-of-Fit Test 
In a goodness-of-fit test, the degrees of freedom are given by 


df = k-1 (8.2) 
where k = the number of possible outcomes or categories for an experiment. 
Note that the chi-square goodness-of-fit test is always a right tailed test. 


The Statistic for a Goodness-of-Fit Test 


The test statistic for a goodness-of-fit test is x? and its value is computed from 


7 = yo | (8.3) 


where O = observed frequency for a category 
E = expected frequency for a category = np. 


To make a goodness-of-fit test, the sample size must be large enough so that the expected frequency 
for each category is at least 5. Table 8.1 gives a step-by-step method for conducting a chi-square goodness- 
of-fit test. 


Table 8.1: chi-square goodness-of-fit test 


Purpose: To perform a hypothesis test for the distribution of a variable 


Assumptions: 
Le All expected frequencies are | or greater 
2 At most 20% of the expected frequencies are less than 5 
o: Simple random sample 


Step 1: The null and alternative hypotheses are 
Hy: The variable has the specified distribution. 
H;: The variable does not have the specified distribution. 


Step 2: Calculate the expected frequency for each possible value of the variable by using the formula 


E = np, where n is the sample size and p is the relative frequency (or probability) given for the 
value in the null hypothesis. 


Step 3: Determine whether the expected frequencies satisfy Assumptions | and 2. If they do not, this 
procedure should not be used. 


Step 4: Decide on the significance level,a 
Step 5: Compute the value of the test statistic 
x? =LX(O- EV/E 


where O and E represent observed and expected frequencies, respectively. Denote the value of 
the test statistic y”. 
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Critical-value Approach 


Step 6: The critical value is X,, with df = k — 1, where k is the number of possible values for the 


variable. Use table in Appendix-F to find the critical value. 
Do not reject Hp Reject Hy 


eg ee 


Step 7: If the value of the test statistic falls in the rejection region, reject Ho; otherwise, do not reject Ao. 


Step 8: Interpret the results of the hypothesis test. 


Example E8.1 


A home mail order company likes to determine if the number of orders received on each day of the five 
working days of the week is the same. Table E8.1 lists the frequency distribution for these orders received 
from a sample of 400 orders during a four-week period. 


Table E8.1 


Day of the week Monday | Tuesday | Wednesday | Thursda 
Number of orders received | 90 | 69 | 67 88 86 
Test at the 5% significance level if the null hypothesis that the orders are evenly distributed overall 
days of the week is true. 


SOLUTION: 
H, : orders are evenly distributed overall days of the week. 
H, : orders are not evenly distributed overall days of the week. 
df = k-1=5-1=4 
For a = 0.05 and df = 4, the critical value of x? = 9.488 [From the chi-square distribution table in 
Appendix-F]. 


[Day 0 |p [E=np [0-5 


Monday 80 10 
Tuesday 80 
Wednesday 80 
Thursday 80 
Frida 80 


86 | 0. 
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The values of the test statistic is 
»_w(O-E) 
=e = 


Since 6.125 < 9.488, do not reject H,. Hence, we conclude that orders are evenly distributed over all days of 
the week. 


Example E8.2 


Table E8.2 presents the frequency distribution of automobiles sold at an automobile dealership during the 
past 12 months. 


Table E8.2 


Month ; 
16 17 DE 


Automobiles sold 


Can we reject the null hypothesis that the number of automobiles sold at this automobile dealership is 
the same for each month at the 10% significance level? 


SOLUTION: 
H, : The number of automobiles sold is the same for each month. 
H, : The number of automobiles sold is not the same for each month. 
Gg = k=l =12-1 11 
From the chi-square distribution table in Appendix-F, the critical value of x° for df = 11 and 0.10 area in the 
right tail is 17.275 [From the chi-square distribution Table in Appendix-F]. 


The number of automobiles sold will be the same for each month if 1/12" of the automobiles sold during the 
full year are sold each month. Table E8.2(a) shows the computations. 


Table E8.2(a) 


Month 0} 2 E=np_[ 0-6 | (0B) | O-EyiE 
January , ; 
February 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 


HAS] 1303085 |__ 6533 | 


Hence, the value of the test statistic is 


) O-E) 
a= > = 20,3686 


Reject the null hypothesis. The number of automobiles sold is not the same for each month. 


8.3 CONTINGENCY TABLE. | Stas 


When information on more than one variable for each element is summarised and presented in a two-way 
classification table, such a table is called contingency table or cross-tabulation. Table 8.2 is an example of 
a contingency table which contains the results obtained for four samples of fixed sizes selected from four 
populations. 


Table 8.2 


Sample selected from 
Population 2 Population 3 


Population I 
25 


Population 4 
60 123 
90 73 


106 94 

A contingency table can be of any size. In general, an R x C table contains R rows and C columns. 
Each of the twelve boxes that contain the numbers in Table 8.2 is called a ce//. In Table 8.2, the number of 
cells = R x C=3 x 4 = 12 cells. The numbers written inside the cells are generally called the joint frequencies. 


8.4 A TEST OF INDEPENDENCE OR HOMOGENEITY , 


The tests of independence or homogeneity are performed using the contingency table described in 
section 8.3. The procedure to make the tests is very similar to the one applied in for a goodness-of-fit tests. 


A test of independence involves a test of the null hypothesis that two attributes of a population are 
not related. The degrees of freedom for a test of independence are given by 
df = (R-1)(C-)) (8.4) 
where R and C are the number of rows and the number of columns, respectively. in a given contingency 
table. 


Test Statistic for a Test of Independence: 
The value of the test statistic x? for a test of independence is obtained from 


2 
o yo (8.5) 


where O and E are the observed and expected frequencies, respectively, for a cell. 

The null hypothesis in a test of independence is always that the two attributes are not related. The 
alternative hypothesis is that the two attributes are related. The frequencies obtained from the performance 
of an experiment for a contingency table are called the observed frequencies. The ig to — 
the expected frequencies for a contingency table for a test of independence is slightly different from the 


one for a goodness-of-fit test. 
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Expected Frequencies for a Test of Independence 


row total) (column total 
The expected frequency E for a cell is calculated from E = com total) (Pole) 
sample size 


A step-by-step method for conducting a chi-square independence test by using the critical value 
approach is given in Table 8.3. 


Table 8.3: chi-square independence test 


Purpose: To perform a hypothesis test to decide whether two variables are associated 


Assumptions: 
lt: All expected frequencies are 1 or greater 
2. At most 20% of the expected frequencies are less than 5 
a) Simple random sample 


Step 1: The null and alternative hypotheses are 
Ho: The two variables are not associated. 
H,: The two variables are associated. 


Step 2: Calculate the expected frequencies by using the formula E = RC/n, where R = row total, 
C = column total, and n = sample size. Place each expected frequency below its corresponding 
observed frequency in the contingency table. 

Step 3: Determine whether the expected frequencies satisfy Assumptions | and 2. If they do not, this 
procedure should not be used. 


Step 4: Decide on the significance level,a 


Step 5: Compute the value of the test statistic 

x? =2(O-EY/E 
where O and E represent observed and expected frequencies, respectively. Denote the value of 
the test statistic x7. 


Critical-value Approach 
Step 6: The critical value is x. with df = (R — 1) x (C — 1), where R and C are the number of possible 
values for the two variables. Use table in Appendix-F to find the critical value. 


Do not reject Ho Reject Ho 


| 
| 
( 
| 
1 
! 
| 
' 
! 
! 
I 


0 iE 


Step 7: If the value of the test statistic falls in the rejection region, reject Hy; otherwise, do not reject Ho. 
Step 8: Interpret the results of the hypothesis test. 
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Example E8.3 
—— eee 
Table E8.3 shows the results of a random survey of 100 people who were checked by police to see if they 


were wearing seat belts. Test at the 2.5% significance level if being a male or a female and wearing or not 
wearing a seat belt are related. 


Table E8.3 


Wearing seat belt 
35 
30 


Not wearing seat belt 


Male 
Female 


SOLUTION: 
Hi, : Gender and wearing or not wearing of seat belt are not related. 
H,, : Gender and wearing or not wearing of seat belt are related. 
poke 1) -2_-)e@-l)=1 
Table E8.3(a) 


|__| Wearing a seat belt 
Men SONS5-/ 2) 20 (19.25) 55 
Women 30 (29.25) 1S. (15:75) 45 


Li ai ae ci 


The numbers in parentheses are expected frequencies, which are calculated using the formula 


Total 


_ (row total) (column total) 
total of both samples 


The values of the test statistic is 


O=EY 
‘ae yo = 0.0157 + 0.0292 + 0.0192 + 0.0357 = 0.0999 


Since 0.0999 < 5.024 [From the chi-square distribution table in Appendix-F], do not reject the null hypothesis. 
Being a male or female and wearing or not wearing a seat belt are ot related. 


Example E8.4 


A study conducted to find the relationship between smoking and household income (Rupees per month) 
for males aged 15 years and older in a large city in India gave the two-way distribution of the observed 
frequencies based on household incomes and smoking status as shown in Table E8.4. 


Table E8.4 


Household income | Current smokers | Former smokers | Never smokers 
(in Rupees) 


Less than 5,000 


5,000 to 6,999 
7,000 to 9,999 
10,000 to 14,999 
15,000 or higher 
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Using the 5% significance level, can we calculate that household incomes and smoking status are 
dependent for people living in that city? 


SOLUTION: 
H, : Household income and smoking status are independent. 
H, : Household income and smoking status are dependent. 
df = (R-)(C-YD=6-HDG-H=8 
From the chi-square distribution table in Appendix-F, the critical value of x° for df = 8 and 0.05 area in the 
right tail is 15.507. 


The value of the test statistic is 


We oS = 0.758 + 5.122 + 0.45 + 1.567 + 0.003 + 3.215 + 0.958 + 0.842 + 1.38 + 
1.756 + 2.091 + 5.227 + 0.099 + 0.482 + 0.075 = 23.369 


Reject the null hypothesis. Household income and smoking status are dependent for people living in that 
city. 


Table E8.4(a) shows the detailed computations. The numbers in parentheses are expected frequencies, which 


are calculated using the formula — = eovaiou(colunniees 


total of both samples 


Table E8.4(a) 


Household income | Current smokers | Former smokers 


< 5000 360 (376.901) 2OCI2 12) 190 (180.979) 
5000-6999 500 (422.779) 15 (15.204) 200 (227.017) 
7000-9999 650 (625.523) 16 (20.116) 280 (300.361) 
10000-14999 460 (489.31) 10\(15;736) 270 (234.955) 


2 15000 300 (305.488) 12 (9.824) 150 (146.688) 


Example E8.5 


Two brands of drugs were experimented on two groups of randomly selected patients for a particular illness. 
Table E8.5 shows the information about the number of patients who cured and not cured by the 
administration of the two brands of drugs. 


Test at the 1% significance whether or not the two brands are similar in curing and not curing the 
patients. 
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SOLUTION: 
H,, : The two brands are similar in curing the patients. 
H, : The two brands are not similar in curing the patients. 
df = (R-1) (C-)l=2-N2-1=!1 


From the chi-square distribution table in Appendix-F, the critical value of x° for df = | and 0.01 area in the 
right tail is 6.635. 


Table E8.5(a) 


Cured Not cured Total 
Brand B 17 (24.8) ZyCS:2) 40 
62 38 100 


The numbers in parentheses are expected frequencies, which are calculated using the formula: 


Ez (row total) (column total) 
(total of both samples) 


Pas O-E) 
The value of the test statistic is x’ = ©} oer = 1.6355 + 2.6684 + 2.4532 + 4.0026 = 10.7598 


Reject the null hypothesis. The two drug brands are not similar in curing the patients. 


A Test of Homogeneity 

A test of homogeneity involves testing the null hypothesis that the proportions of elements with certain 
characteristics in two or more different populations are the same against the alternative hypothesis that 
these proportions are not the same. 

The procedure to make a test of homogeneity is very similar to the procedure used to make a test of 
independence discussed in earlier section. A test of homogeneity is also right tailed like the test of 
independence. 


Example E8.6 


ne er eee es ————EoEE———EEEEEEEEEEEEEE—EEEE—E—_——== 
A survey of 500 adults (250 men and 250 women) conducted in a particular city in India gave their opinions 
on weather forecasts and the greenhouse effect as given in the two-way classification table below: 


Table E8.6 


Men | Women 


Have been big changes 
Have not been 

Only small changes 

Do not know (no opinion) 
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Is there sufficient evidence from the data to reject the null hypothesis that men and women have similar 
opinions about the greenhouse effect at 1% significance level? 


SOLUTION: 
H, : Men and women have similar opinions about the greenhouse effect. 
H, : Men and women do not have similar opinions about the greenhouse effect. 
df = (R-1)(C-1=4-1@-)=3 
For a = 0.01 and df = 3, the critical value of y° is 6.251 [From the chi-square distribution table in 
Appendix-F’]. 
Table E8.6(a) 


Have been big changes 85 (100) | 115 (100) 200 
Have not been 110 (90) 70 (90) 180 
Only small changes 40 (35) 30 (35) 70 
Do not know (no opinion) 15 (25) 35025) 50 


The numbers in parentheses are expected frequencies, which are calculated using the formula: 


(row total) (column total) 


(total of both samples) 


The value of the test statistic is 


: (C=By- 
x? = ao 2.25 + 2.25 + 4.4444 + 4.4444 + 0.7143 +0.7143 +444 


= 22.8175 


Since 22.8175 > 6.251, reject the null hypothesis. Men and women do not have similar opinions about the 
greenhouse effect. 


Example E8.7 


A random sample of 100 people selected were asked whether or not they support a certain form of subsidy 
program from each of the four regions of the United States. Table E8.7 shows the summary of the results 
obtained. Using the 1% significance level, test the null hypothesis that the percentages of people with 
different opinions are similar for all four regions. 


Table E8.7 


[ [Favour [ Oppose | Uncertain 


Northeast 55 =P) 13 
75 PAL 4 
65 28 7 
60 34 6 


Midwest 
South 
West 
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SOLUTION: 
H, : The percentage of people with different opinions are similar for all four regions. 
H, : The percentage of people with different opinions are not similar for all four regions. 
df = (R-1)(C—1)=(4-1) 3-1)=6 
From the chi-square distribution table in Appendix-F, the critical value of 7° for df = 6 and 0.01 area in the 


right tail is 16.812. 
Table E8.7(a) 


ose | Uncertain Total 
52028:75): | 1355) 


21 (28.75) | 4(7.5) | 100 
| 


Favour 

Northeast | 55 (63.75) 
Midwest 3) (03.75) 
South 65 (63.75) 


Zolzes) | 7(2-5) 


West 60 (63.75) |_34 (28.75) | 6(7.5) | 100 
34 (28.75) | 67 
Total 15 —«([30~——«*|:—=«4000 


The numbers in parentheses are expected frequencies, which are calculated using the formula: 


_ (row total) (column total) 


(total of both samples) 


The value of the test statistic is 


=e 
c= Ce = 1.201 + 0.3674 + 4.0333 + 1.9853 + 2.0891 + 1.6333 + 0.0245 + 0.0196 


+ 0.333 + 0.2206 + 0.9587 + 0.3 = 12.8662 


Do not reject the null hypothesis. The percentages of people with different opinions are similar for all four 
regions. 


8.5 INFERENCES ABOUT THE POPULATION VARIANCE 


The sample variance is a random variable and it possesses a sampling distribution. When all the possible 
samples of a given size are taken from a population and their variances are calculated. the probability 
distribution of these variances is called the sampling distribution of the sample variance. 

If the population from which the sample is selected is approximately normally distributed. then 
(n — 1)s?/o? has a chi-square distribution with n — 1 degrees of freedom. 

Hence, the chi-square distribution can be used to construct a confidence interval and test a hypothesis 


about the population variance 0”. 


8.6 ESTIMATION OF THE POPULATION VARIANCE 


Considering that the population from which the sample is selected is approximately normally distributed. 
the (1 — «)100% confidence interval for the population variance s’ is given by 


(n- 1)s? (n- 1)s” (8.7) 


LO ar 


2 2 
Xa/2 X1-a/2 
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where %.2 and Xia are obtained from the chi-square distribution table in Appendix-F for a/2 and 
1 — o/2 are as in the right tail of the chi-square distribution curve, respectively, and for n — | degrees of 
freedom. The confidence interval for the population standard deviation can be obtained by simply taking 


the positive square root of the two limits of the confidence interval for the population variance. 


Example E8.8 and E8.9 illustrate the estimation of the population variance and population standard 


deviation. 


Example E8.8 


Pe _ al 


Particular machine parts manufactured by a company must have a variance of 0.0025 square millimeters or 


less. A random sample of 29 such machine parts gave a variance of 0.005 square millimeters. 


(a) 


(b) 


test at the 1% significance level if the variance of all such machine parts is greater than 
0.0025 sq mm. Assume that all the machine parts manufactured by the company are approximately 


normally distributed. 
make the 98% confidence intervals for the population variance and standard deviation. 


SOLUTION: 


(a) 


(b) 


ar = 0.0025 
H, : o 20.0025 
Area in the right tail = 7 = 0.01 
df =wiaab—29.-— heaves 


x tor df = 28 and 0.01 area in the right tail is 48.278 [From the chi-square distribution table in 


Appendix-F]. 
The value of the test statistic is 


ne (n- De _ (29-1)(0.005) _ sf 
(9) 0.0025 
Reject the null hypothesis. The population variance is greater than 0.0025 sq mm. 
Tp = 0.01 and 1 - se 1-0.01 =0.99 
df = n-1=29-1=28 
x° for 28 df and 0.01 area in the right tail = 48.278 
x° for 28 df and 0.99 area in the right tail = 13.565 
The 98% confidence interval for the population variance 6? is 


(n-1)s*  (n—1)s” _ (29=1)(0.005)  (29-1)(0.005) 


to = - 
Xora cae 48.278 13.565 = 000289987 to 0.0103206 


The 98% confidence interval for o is 


V0.00289987 to V0.0103206 = 0.05385 to 0.10159 
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Example E8.9 
i 


The manufacturer of a certain brand of light bulbs claims that the variance of the lives of these bulbs is 
4500 square hours. An independent agency took a random sample of 25 such bulbs and tested them. The 
variance of the lives of these bulbs was found to be 5000 square hours. Assume that the lives of all such 
bulbs are approximately normally distributed. 


(a) make a 99% confidence intervals for the variance and standard deviation of the lives of all such 
bulbs. 


SOLUTION: 
(a) df=n—1=25-1=24 
o/2 = 0.5 — (0.99/2) = 0.005 and 1 — o/2 = 1 — 0.005 = 0.995 
From the chi-square distribution table in Appendix-F,, we have 
x for 24 df and 0.005 area in the right tail = 45.559 
x? for 24 df and 0.995 area in the right tail = 9.886 
The 99% confidence interval for the population variance o? is 


aa Ng =h)s7 25 —1)(5000 - 
Xo/2 


Da 45.559 9.886 
The 99% confidence interval for o is 


2633.9472 to ¥12138.3775 = 51.3220 to 110.1743 


8.7. HYPOTHESIS TESTS ABOUT THE POPULATION VARIANCE 


The value of the test statistic x? is calculated as 


oS (8.8) 
fe) 
where s? =the sample variance 
co? =the hypothesised value of the population variance 
n—1= the degrees of freedom 
The population from which the sample is selected is assumed to be approximately normally distributed. 
A test of hypothesis about the population variance can be one-tailed or two-tailed. 


Examples E8.10 and E8.11 illustrate the procedure to make tests of hypotheses about o°. 


Example E8.10 

The variance of the SAT scores for all students who took the test this year is 5100. The variance of the 
SAT scores for a random sample of 20 students from one school is equal to 3220. Test at the 2.5% significance 
level if the variance of the SAT scores for students from this school is lower than 5100. Assume that the 


SAT scores for all students at this school are approximately normally distributed. 
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SOLUTION: 
H,: © =5100 
H, : 0° <5100 
Area in the left tail = a = 0.025 
Area in the right tail = 1 — @ = 1 — 0.025 = 0.975 
df=n—1=20-1=19 
x? for df= 19 and 0.975 area in the right tail = 8.907 
[From the chi-square distribution table in Appendix-F). 
The value of the test statistic is 
> (n-l)s* _ (20-1)(3220) 
Ret 5100 
Do not reject the null hypothesis. The population variance of the SAT scores for students from the given 
school is not lower than 5100. 


= 11.996 


Example E8.11 
A random sample of 25 students taken from a particular university gave the variance of their GPAs equal to 
0.20. 


(a) construct the 99% confidence intervals for the population variance and standard deviation. Assume 
that the GPAs of all students are approximately normally distributed. 


(b) the variance of GPAs of all students at this university was 0.12 two years ago. Test at the 1% 
significance level if the variance of GPAs now is different from 0.12. 
SOLUTION: 
(a) o/2 = 0.5 — (0.99/2) = 0.005 and 1 — o/2 = 1 — 0.005 = 0.995 
df=n-1=25-1=24 
From the chi-square distribution table in Appendix-F, we have 


x for 24 df and 0.005 area in the right tail = 45.559 and yx? for 24 df and 0.995 area in the 
right tail = 9.886. 


The 99% confidence interval for the population variance 6? is 
(n-l)s° miGts I)s* _ @5-N0.2) (25 — 1)(0.20) 
Les ieee 45.559 9.886 
The 99% confidence interval for o is 
0.10536 to V0.48554 = 0.3246 to 0.6968 
(b) H,:0° =0.12 
H,:0° #0.12 
o/2 = 0.01/2 = 0.005 and 1 — o/2 = 1 — 0.005 = 0.995 
df=n-1=25-1=24 


= 0.10536 to 0.48554 
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From the chi-square distribution table in Appendix-F, we have 


X° for 24 df and 0.005 area in the right tail = 45.559 and X° for 24 df and 0.995 area in the right tail = 9.886. 
The value of the test statistic is 


ye a =1)se (25 =1)(0.2). _ 


Oo” 0.12 
Since 40 is between 9.886 and 45.559, do not reject H,. The variance of the GPAs is still equal to 0.12. 


PROBLEMS 


A Goodness-to-Fit Test 


P8.1 A randomly selected sample of 100 people who suffer from allergies were asked during what season 
they suffer the most. Table P8.1 gives the results of this survey. 


Table P8.1 


Season Fall | Winter | Spring | Summer 


People allergic | 19 11 33 37 


Using 1% significance level, test the null hypothesis that the proportions of all allergic people are 
equally distributed over the four seasons. 


P8.2 A roulette wheel contains 18 red numbers, 18 black numbers and 2 green numbers. Table P8.2 
shows the frequency with which the ball landed on each colour in 200 trials. 


Table P8.2 


| Number __| Red | Black | Green | 


At the 5% significance level, do the data suggest that the wheel is out of balance? 


P8.3 =‘ Table P8.3 lists the age distribution for a sample of 100 people arrested for drunk driving in the 
year 2007 in a small city in US. Using the 1% significance level, can we reject the null hypothesis 
that the proportion of people arrested for drunk driving is the same for all age groups? 


Table P8.3 


Age_ 18-25 | 26-35 | 36-45 | 46-55 | 56 and older 
Arrests | 31 | 26 20 2)| 715 8 


P8.4 Table P8.4 shows the results of a survey in 2008 of married woman aged 45 and over when they 
were asked who makes the financial decisions in their households. In a recent sample of 800 married 
woman aged 45 and over, 136 indicated that in their households they make the financial decisions 
themselves, 248 said that their husbands make such decisions, and 416 stated that such decisions 
are made by both together. Test at the 2.5% significance level whether the current pattern of 


financial decision-making is different from that for 2008. 


396 // Probability and Statistics for Scientists and Engineers // 


Table P8.4 
Decision made b Wife | Husband | Both 


Percentage of households 


P8.5 Table P8.5 shows the data obtained when the die is rolled 150 times in order to test whether the die 
is loaded or not (that is, to check whether the six numbers are equally likely or not). 
Table P8.5 


Number Pal os) eee as 
Frequency || 23 | 26 | 23 | 21 | 31 | 26 


Test at the 0.05 level of significance. do the data provide sufficient evidence to conclude that the 
die is loaded? 


Contingency Tables 


P8.6 A random sample of 300 adults was selected and they were asked if they favour giving more freedom 
to elementary school teachers to approximately punish students for lack of discipline. The results 
are summarised in Table P8.6. 

Table P8.6 


Sisk 3 
(F) (A) (N) 

34 6 

Find the expected frequencies for this Table P8.6 assuming that the two attributes, gender and 

opinions are independent on the issue. 


P8.7 —_ Refer to Problem P8.6. Use a 1% significance level to determine whether the sample provide sufficient 
information to conclude that the two attributes, gender and opinion of adults, are dependent. 


P8.8 — The results obtained as shown in Table P8.8 from a survey of college students who were interviewed 
on their drinking behaviour. Students were classified according to their academic performance at 
the college. The students were asked the question whether or not he/she had engaged in binge 
drinking (five or more drinks in a row) at least once during the past month. 


Table P8.8 


College performance 


Binge drinking 


Excellent 
Average 
Below average 


Using the 5% significance level, can we conclude that college performance and binge drinking are 
related for all college students? 


P8.9 — Table P8.9 gives a two-way classification of divorced fathers based on a legal study and compliance 


with child support court orders. At the 1% significance level, can we conclude that the legal custody 
and compliance with child support orders are related? 
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Table P8.9 
Legal custody Number of fathers who | 


Mother 
Joint/split 


P8.10 Two random samples, one of 100 administrators and a second of 50 engineers were taken from a 


large company. These employees were asked about their views on a certain company issue. Table 
P8.10 gives the results of this survey. 


Table P8.10 


| Administrators | 61 | 30 [| 9 
) 0 


Use the 2.8% significance level, test the null hypothesis that the distribution of opinions are 
homogeneous for the two groups of workers. 


A Test of Independence or Homogeneity 


P8.11 Table P8.11 gives the data on income distribution for households in Visakhapatnam and Hyderabad. 
Using the 2.5% significance level, test the null hypothesis that the distribution of households 
with regard to income levels is similar (homogeneous) for the two cities. 


Table P8.11 


|_| Visakhapatnam 


High income 


34 


Medium income 39 
Low income Wi 
[50 


P8.12 Table P8.12 shows from a sample of 500 people who held more than one job to make ends meet. 


Table P8.12 


Test at the 10% significance level if gender and marital status are related for all people who hold 
more than one job. 
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P8.13 A random sample of 100 people was selected from each of the four regions of the state of Andhra 
Pradesh. These people were asked whether or not they support a certain farm subsidy programme. 
The results of the survey are summarised in Table P8.13. 


Table P8.13 


Favour No opinion 


Northeast 


Midwest 
South 
West 


Using the 1% significance level, test the null hypothesis that the percentages of people with 
different opinions are similar for all four groups. 


P8.14 Two drugs were given to two randomly selected patients to cure the same disease. Table P8.14 
gives the data about the number of patients who were cured and not cured by each of the two 
drugs. 


Table P8.14 


|__| Cured | Not cured | 


Test at the 1% significance level whether or not the two drugs are similar in curing and not curing 
the patients. 


P8.15 Table P8.15 gives the results of a survey of two samples taken from a company workers. The workers 
were asked about their views on health care programme. Using the 2.5% significance level, test the 
null hypothesis that the distributions of opinions are homogeneous for the two groups of workers. 


Table P8.15 


Favour | Oppose | No opinion 


Male workers | 46 | 40 | 9 | 


inferences about the Population Variance 


P8.16 One type of machine component is manufactured by a company. The machine that fills packages 
of these parts is set up in a way that the average net weight of these packages is 32 gms with a 
variance of 0.015 sq.gms. Periodically, the quality control inspector selects a sample of a few such 
packages to construct a 95% confidence interval for the population variance. If either of both or 
one of the two limits of the confidence interval is not in the interval 0.008 to 0.030, the machine is 
stopped and adjusted. A random sample of 25 packages from the population gave a sample variance 
of 0.030 sq.gms. Based on this sample data, do we say that the machine needs an adjustment? 
Assume that the net weights of all parts in all packages are normally distributed. 
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P8.17_ A sample of 16 observations selected from a normally distributed population produced a sample 
variance of 1.11. Write the null and alternative hypothesis to test whether the population variance 


is greater than 0.75. For a= 0.01, find the critical value of x°. Using the 1% significance level. can 
we reject the null hypothesis? 


P8.18 A sample of 14 observations selected from a normally distributed population produced a sample 
variance of 4.7. Write the null and alternative hypothesis to test whether the population variance 
is different from 2.3. Using © = 0.05, find the critical values of x°. Using the 5% significance level, 
can we reject the null hypothesis stated in above? 


P8.19 Steel bolts for a machine manufactured by a company must have a variance of 0.0025 sq.mm or 
less for acceptance y a customer. A random sample of 29 such bolts gave a variance of 
0.0058 sq.mm. Assume that the lengths of all bolts manufactured by this company are approximately 
normally distributed. Test at the 1% significance level if the variance of all such bolts is greater 
than 0.0025 sq.mm. 


P8.20 The manufacturer of a certain brand of light bulbs claims that the variance of the lives of these 
bulbs is 4500 square hours. In a random sample of 25 such bulbs, the variance of the lives of these 
bulbs was found to be 4800 square hours. Assume that the lives of all such bulbs are approximately 
normally distributed. Test at the 5% significance level if the variance of such light bulbs is different 
from 4500 square hours. 


Estimation of the Population Variance 


P8.21 Refer to Problem P8.19. Make a 98% confidence intervals for the population variance and standard 
deviation. 


P8.22 Refer to Problem P8.20. Make the 99% confidence intervals for the variance and standard deviation 
of the lives of all such bulbs. 


P8.23 Refer to Problem P8.16. Use a = 0.01. Do we think that the machine needs an adjustment? 


P8.24 The variance scores on a standardised science test for all high school seniors was 150 in the year 
2008. A sample score for 20 high school seniors who took this test in the year 2008 gave a variance 
of 173. Test at the 5% significance level if the variance of current scores of all high school seniors 
on this test is different from 150. Assume that the scores of all high school seniors on this test are 
approximately normally distributed. 


| REVIEW QUESTIONS -— 


1. Why is the phrase “goodness-of-fit” used to describe the type of hypothesis test considered in 
this chapter? 

2. Are the observed frequencies variables? Explain your answers. 
Are the expected frequencies variables? Explain your answers. 

4. To find out whether two variables of a population are associated, we usually need to resort to 
inferential methods such as chi-square independence test. Why? 

5. Explain why a chi-square goodness-of-fit test or a chi-square independence test is always right 


tailed. 
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6. 


If the observed and expected frequencies for a chi-square goodness-of-fit test or a chi-square 
independence test matched perfectly, what would be the value of the test statistic? 

State the assumptions for a chi-square goodness-of-fit test or a chi-square independence test 
regarding the expected frequency. 

Describe the four characteristics of a multinomial experiment. 

When is a goodness-of-fit test applied? Explain. 

What is the difference between the observed and expected frequencies for a goodness-of-fit test? 

How is the expected frequency of a category computed for a goodness-of-fit test? 

Describe the test of independence. 

Describe the test of homogeneity. 

How are the expected frequencies for cells of a contingency table calculated in a test of 
independence or homogeneity? 


oe SS) sales 


nA 


STATE TRUE OR FALSE 


The random variable x? assumes only non-negative values. (True/False) 
The parameters of the chi-square distribution are df and n. (True/False) 
The trials are random is a characteristic of a multinomial experiment. (True/False) 
The observed frequencies for a goodness-of-fit test are the frequencies obtained from the 
performance of an experiment. (True/False) 
The expected frequencies for a goodness-of-fit test are the frequencies given by the product of n 
and p. (True/False) 
The degrees of freedom for a goodness-of-fit test are n + k — 1. (True/False) 
The chi-square goodness-of-fit test is always right tailed. (True/False) 
The expected frequency of each category must be at least 8 to apply a goodness-of-fit test. 
(True/False) 
The degrees of freedom for a test of independence are (R — 1) (C — 1). (True/False) 
The trials are independent in a multinomial experiment. (True/False) 


The probabilities of the various outcomes remain constant for each trial in a multinomial experiment. 
(True/False) 


The value of the test statistic x° for a test of independence is calculated as ie = py 


(True/False) 
The test of independence is a test of null hypothesis that two attributes of a population are related. 


(True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. False 3. False 4. True 5.True 6. False 7. True’ 8. False 9. True 10. True 
11. True 12. True 13. False 


CHAPTER 


Analysis of Variance 


91 INTRODUCTION 


In Chapter 5, we studied how to compare two population means, that is, the means of a single variable for 
two different populations. We reviewed various methods for making such comparisons. We described the 
procedures that are used to test hypotheses about the difference between two population means using the 
normal and f¢-distributions. The hypothesis testing procedures for the difference between two population 
proportions using the normal distribution were also described. In Chapter 6, the procedures to test 
hypotheses about the equally of more than two population proportions using the chi-square distribution 
were also presented. Analysis of variance (ANOVA) provides methods for comparing several population 
means, that is, the means of a single variable for several populations. In this chapter. we present the simplest 
kind of ANOVA, one-way analysis of variance. This type of ANOVA is called one-way analysis of variance 
because it compares the means of a variable for populations that result from a classification by one other 
variable, called the factor. The possible values of the factor are referred to as the levels of the factor. One- 
way analysis of variance is the generalisation to more than two populations of pooled /-procedure. 


An analysis of variance test is performed using the F-distribution. This chapter explains how to test 
the null hypothesis that the means of more than two populations are equal. 


9.2 ONE-WAY ANALYSIS OF VARIANCE 


ANOVA is a procedure used to test the null hypothesis that the means of three or more populations are 
equal. In this section, we discuss the one-way ANOVA procedure to make to make tests comparing the 
means of several populations. By using one-way ANOVA test, we analyse only one factor or variable. The 
procedure under discussion is called the analysis of variance because the test is based on the analysis of 
variation in the data obtained from different samples. The application of one-way ANOVA requires the 


following assumptions. 


Assumptions (conditions) for one-way ANOVA 
1. Simple random samples: The samples taken from the populations under consideration are simple 


random samples. 
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2. Independent samples: The samples taken from the populations under consideration are independent 
of one another. 
3. Normal populations: For each population, the variable under consideration is normally distributed. 
4. Equal standard deviations: The standard deviations of the variable under consideration are the 
same for all the populations. 
The reason for the word variance in analysis of variance is that the procedure for comparing the 
means analyses the variation in the sample data. 
The one-way ANOVA test is always right-tailed with the rejection region in the right-tail of the 
F-distribution curve. 


The value of the test statistic F for a test of hypothesis using ANOVA is given by the ratio of two variances, 
the variance between samples (MSB) and the variance within samples (MSW). The value of the test statistic 
F for an ANOVA test is given by 


variance between samples nat MSB (9.1) 


variance within samples MSW 


The between-samples sum of squares, denoted by SSB is calculated as 


| ees To Ve (ay 
ssp-[ By...) oe (9.2) 
The within samples sum of squares, denoted by SSW, is calculated as 
Pa ame 16 
SSW = <x (BBs (9.3) 
where n, = the size of the sample 
T. = the sum of the values in sample i 
n= the number of values in all samples = n, + n, + -:- 
Xv = the sum of the values in all samples = 7, + T, + T,+ --- 
Xx° = the sum of the squares of the values in all samples. 
The total sum of squares, denoted by SST, is calculated as 
SST = SSB +SSW (9.4) 
The MSB and MSW are calculated as MSB = — 
and MSW = _ (95) 


where k — | and n —k are, respectively, the degrees of freedom for the numerator and the degrees of freedom 
for the denominator for the F-distribution. 
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Example E9.1 


Table E9.1 gives the number of classes missed during one semester by 25 randomly selected undergraduate 
continuing education students drawn from three different groups. 


Table E9.1 
Below 25 | 25 to 30 | 31 and above 


(a) state the null and alternative hypotheses if we were to test if the mean number of classes missed 
during the semester by all students in each of these three age groups is the same 


(b) show the rejection and non-rejection regions on the F-distribution curve for @ = 0.01 
(c) find SSB, SSW and SST 

(d) find the between-samples and within-samples variances 

(e) find the calculated value of the test statistic F 

(f) show the ANOVA table for this problem. 


SOLUTION: 

(a) Aye w=p,=H, 
H.: All three population means are not equal 

(b) k=3andn=n,+n,+n,=10+9+6=25 
df for the numerator = k- 1=3-1=2 
df for the denominator = n — k = 25 — 3 = 22 
For & = 0.01 and df = (2, 22), the critical value of F is 5.72. 
The rejection region lies to the right of F = 5.72. The non-rejection region lies to the left of F = 5.72 
[From the F-distribution table in Appendix-H]. 

(c) T,=143, T, =91 and T, = 39 
Xx =T, + T,+T, = 143 +91 +39 =273 
De = 182 + 112 + 262 + 112 + 1927 + 424 16 412417 +P 41K 4+ S41 +1V +P +P 44412? + 

18° + 3+ 7459 4+9% 49 =3751 


2 2 2 es vi 32 2 2 
RET |__| 143 SEL eT 
nm nm, n, 10 9 6 


n 


ssn-| 
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143? or 


a 2 9? 
Tq EB) aay [BEE | = 532.4889 


SSW = =x" -| — 
mh nN Nn, 


9 
SST = SSB + SSW = 237.3511 + 532.4889 = 769.84 
(d) df forthe numerator =k-1=3-—1=2 
df for the denominator = n — k = 25 ~ 3 = 22 
MSB = SSB/ (k— 1) = 237.3511/ (3 — 1) = 118.6756 
MSW = SSW/ (n— k) = 532.4889/ (25 — 3) = 24.2040 
(e) Fora=0.01 and df= (2, 22), the critical value of F = 5.72 as shown in Fig. E9.1. 
The value of the test statistic is 
F = MSB/MSW = 24.2040/1 18.6756 = 4.9031 
(f) 


ANOVA Table E9.1(a) 


| Source of variation | Degrees of freedom | Sum of squares | Mean square | Value of the test statistic 
Between IBY sll 118.6756 = 118.6756 


Within 532.4889 24.2040 marVICTY) cae 
Total 


Do not reject H, Reject H, 


0 Siz 
Critical value 


Fig. E9.1 


9.4 ONE-WAY ANOVA TEST 


Here, we test the null hypothesis that the means of all groups are all equal for all the population samples 
against the alternative hypothesis that the means of all the groups are not equal. In a one-way ANOVA 
test, the null hypothesis is that the means for all populations are equal. The alternative hypothesis states 
that at least one of the population means is different from the others. Table 9.1 gives a step-by-step method 
for conducting a one-way ANOVA test by using the critical value approach. Because the null hypothesis is 
rejected only when the test statistic, F, is too large, a one-way ANOVA test is always right tailed. 
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Table 9.1: One-way ANOVA test 


Purpose: To perform a hypothesis test to compare k population means, [), lo, ...., Uy, 
Assumptions: 
hi. Simple random samples 
Zs Independent samples 
3 Normal populations 
4. Equal population standard deviations 
Step 1: The null and alternative hypotheses are 
Ao: HW) = U2=.... =p, 
Af,: Not all the means are equal 
Step 2: Decide on the significance level, o 
Step 3: Obtain the three sums of squares, SSB, SSW and SST 
Step 4: Construct a one-way ANOVA table to find the value of the F-statistic 


Source of variation | df | Sum of squares | Mean squares=SS/df| _ F-statistic 


(Test statistic) 


ke Il 
alk 


SSB . 
SSW MSW a MSW 


Between 
Within 


Step 5: The critical value is F. with df = (k— 1,n—k) 
Step 6: If the value of the F-statistic falls in the region of rejection, reyect Hy: otherwise. do not reject fp. 


Do not reject Hp | Reject Hp 


Fig. 9.1 
Step 7: Interpret the results of the hypothesis test. 


Example E9.2 


Refer to Example E9.1. Can we reject the null hypothesis stated in Example E9.1 in part (a) at a significance 
level of 1%? 


SOLUTION: 


Refer to the solution given in Example E9.1. 
Since F = 4.9031 < 5.72, do not reject H,. The mean number of classes missed by all three age groups is the 


same. 
Example E9.3 


Table E9.3 shows the life of the light bulbs (in thousands) of four different brands investigated by a company 
in order to find out if the mean life of the four brands of light bulbs is the same. 
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Table E9.3 
Brand A | Brand B | Brand C | Brand D 


Test the null hypothesis at the 2.5% significance level that the mean life of light bulbs for each of 
these four brands is the same. 


SOLUTION: 
Ay: U, =H, =H, =H, 
H,: The means of the four populations are not equal 
k=4andn=n,+n,+n,+n,=7+74+7+7=28 
df for the numerator = k-1=4-1=3 
df for the denominator = n — k = 28 —-4 = 24 
For o& = 0.025 and df= (3, 24), the critical value of F is 3.72 [From the F-distribution table in Appendix-F]. 
T, = 162, T, = 145, T, = 172, T, = 180 
2x=7,+T,+T7,+T,= 162 + 142 + 172 + 180 =659 


Lx? = (22? + 23? + 00 + 27 + 20? + 22? + ++ + 20? + 23? + 26? + +++ + 29? + 25? + 24? +--+ 27) 
= 15,749 


= = S97 9357 
mh mh Mm MH 


ssu-| 2,22 Cok 
n 


T2 2 2 2 
SSW yx: [EEE 2 = 141.4286 
h hm hm NH 
MSB = SSB/ (k — 1) = 97.5357/ (4 — 1) = 32.5119 


MSW = SSW/ (n — k) = 141.4286/ (28 — 4) = 5.8929 
The value of the test statistic is 


_ MSB _ 32.5119 eee 
~ MSW 5.8929 


Reject H,. The mean life of light bulbs for each of these four brands A, B, C and D is not the same. 
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PROBLEMS 


P9.1 Table P9.1 shows the test scores of the students in three groups of fifteen students selected at 


random and who were experimented with three different methods of teaching algebra. Compute the 
value of the test statistic F. 


Table P9.1 


Method A | Method B 


Method C 
85 


47 60 
12 83 67 
ay 68 
64 70 
88 89 


P9.2 Refer to Problem P9.1. At the 1% significance level, can we reject the null hypothesis that the 
mean algebra score of all students taught by each of these three methods A, B and C is the same? 


P9.3 A study conducted gave the number of customers served by four bank tellers during each of the 
observed hours is summarised in Table P9.3. At the 5% significance level. test the null hypothesis 
that the mean number of customers served per hour by each of these four tellers is the same. Use 
the one-way ANOVA procedure. 


Table P9.3 


Teller A Teller D 
20 23 


19 


25 
28 
19 


P9.4 ‘Table P9.4 shows the data obtained for two samples selected at random from two populations that 
are independent and normally distributed with equal variances. 


Table P9.4 
Sample Al Sample B 
I5 10 
16 2) 

12 12 

Y 17 
2 15 
19 8 


17 9 


408 // Probability and Statistics for Scientists and Engineers // 
Using the one-way ANOVA procedure, test at the 5% significance level whether the means of the 
populations from which these samples are drawn are equal. 


P9.5 Table P9.5 gives the production of wheat (in bushels) for each acre for three brands of fertilizer. At 
the 5% significance level, can we conclude that the mean yield of wheat for each of these three 


brands of fertilizer is the same? 
Table P9.5 
Brand A | Brand B | Brand C 


P9.6 Table P9.6 lists the weight (in kg) gained by fifteen chickens randomly put in each of the three 
groups and fed by three brands of diet. At the 1% significance level, can we conclude that the 
mean weight gained by all chickens is the same for each of these three diets? 


Table P9.6 


P9.7 A consumer agency that wanted to compare the drying times for paints made of three brands 
tested a few samples of paints from each of these three brands. Table P9.7 records the drying 
times (in minutes) for these samples of paints. 


Table P9.7 
56 


62 
61 
ae) 
48 
61 


Using the 5% significance level, test the null hypothesis that the mean drying times for paints of 
these three brands are equal. 


P9.8 A consumer agency wants to check if the mean lives of four brands of light bulbs, which sell for 
nearly the same price, are the same. The agency randomly selected a few light bulbs of each brand 
and tested them. Table P9.8 gives the lives of these light bulbs in thousands of hours. 
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Table P9.8 

Brand D 
B 52 58 57 
67 54 12 52 
51 43 
57 44 
65 


At the 5% significance level, can we reject the null hypothesis that the mean life of each of these 
four brands of listed bulbs is the same? 


P9.9 Table P9.9 gives the independent random samples of households in the four US regions giving the 
last year’s energy consumptions. At the 5% significance level, does the data provide sufficient 
evidence to conclude that a difference exists in the last year’s mean energy consumption by 
households among the four US regions? 


Table P9.9 
Northeast | Midwest | South | West 
12 9 | 
8 11 
9 8 
13 qi 
9 


P9.10 Table P9.10 shows the data from independent simple random samples from several populations. 
Decide at the 5% significance level, whether the data provide sufficient evidence to conclude that 
the means of the populations from which the samples were drawn are not all the same. 


Table P9.10 
Sample 2 | Sample 3 | Sample 4 


REVIEW QUESTIONS He 


Describe the basic properties of F-curves in the F-distribution. 

State the assumptions (conditions) made in one-way ANOVA. 

Explain when a one-way ANOVA procedure is used to make a test of hypothesis. 
Explain the reason for the word variance in the phrase analysis of variance. 

In a one-way ANOVA, identify the statistic used 

(a) asameasure of variance among the sample means 


(b) asa measure of variance within the samples 
(c) to compare the variance among the sample means to the variation within the samples. 


nf WN 
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10. 


11. 


Describe the logic behind one-way ANOVA. 

What does the term one-way signify in the phrase one-way ANOVA? 

In a one-way ANOVA, what is the residual of an observation? 

Following are the notations for the three sums of squares. State the name of each sum of squares 
and the source of variation each sum of squares represents. 

(a) SSB (b) SST (c) SSW 

State the one-way ANOVA identity and interpret its meaning with regard to partitioning the total 
variation in the data. 

In each part, specify what type of analysis one might use? 

(a) To study the effect of one factor on the mean of a response variable. 

(b) To study the effect of two factors on the mean of a response variable. 


(c) If we define s =./y)4sw , of which parameter is s an estimate? 


True or false: If we know any two of the three sums of squares, SST, SSB and SSW, can we 
determine the remaining one? Explain your answer. 


STATE TRUE OR FALSE 


In one-way ANOVA, the populations from which the samples are drawn are approximately normally 


distributed. (True/False) 
The population from which the samples are drawn have the same variance (or standard deviation). 

(True/False) 
The samples drawn from different populations are random and dependent. (True/False) 
The variance between samples is also called the mean square between samples. (True/False) 
The variance within samples is also called the mean square within samples. (True/False) 
ANOVA is a procedure used to test the null hypothesis that the means of three or more populations 
are equal. (True/False) 


The value of the Test statistic F for an ANOVA test is calculated as F =~“ 80° between samples 


variance within samples 


(True/False) 
: SSB 
The MSB is calculated as MSB = ? (True/False) 
= 

The MSW is calculated as MSW = = (True/False) 
oe 7 

The analysis of variance technique that analyses one variable only. (True/False) 
Mean square between samples is a measure of the variation among means of samples taken from 
different populations. (True/False) 
Mean square within samples is a measure of the variation within data of all samples taken from 
different populations. (True/False) 
The F-distribution is continuous. (True/False) 


The F-distribution is always skewed to the left. (True/False) 
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15. The units of the F-distribution are always positive. 


(True/False) 

16. The one-way ANOVA test analyses only one population. (True/False) 
17. The one-way ANOVA test is always right-tailed. (True/False) 
18. For a one-way ANOVA with & treatments and 7 observations in all samples taken together, the 
number of degrees of freedom for the numerator are k — 1. (True/False) 


19. For a one-way ANOVA with & treatments and » observations in all samples taken together, the 
number of degrees of freedom for the denominator are n — k. (True/False) 


20. The ANOVA test can be applied to compare more than four population means only. (True/False) 


ANSWERS TO STATE TRUE OR FALSE 


1. True 2. True 3. False 4. True 5. True 6.True 7. True’ 8. False 9. True 10. True 
11. True 12. True 13. True 14. False 15. False 16. False 17. True 18. True 19. True 20. False 


ea) G: 
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Appendix—A 


Values for n Factorial 


120 
720 
5040 
40.320 
362.880 


3.628.800 
39,916,800 
479,001,600 
6,227,020,800 
87,178,291 ,200 


1.307.674,368.000 
20,922,789,888,000 
355,687,428 ,096,000 
6.402,373,705,728,000 
121.645, 100,408,832,000 


2,432,902.008, 176,640,000 
51,090,942,171,709,440,000 
1,124,000,727,777,607,680,000 
25,852,016,738,884,976,640,000 
620,448,401 ,733,239,439,360,000 


15,511,210,043,330,985,984,000,000 
403,291,461,126,605,635,584,000,000 
10,888,869,450,418,352,160,768,000,000 
304,888,344,611,713,860,501,504,000,000 
8,841,761 ,993,739,701,954,543,616,000,000 
265,252,859,812,191,058,636,308,480,000,000 
8.22284 x 10° 
2.63131 x 10 
8.68332 x 10° 
2.95233 x 10° 
1.03331 x 10” 


3.71993 x 10"! 
1.37638 x 10% 
5.23023 x 10% 
2.03979 x 10% 


8.15915 x 10” 
3.34525 x 10” 
1.40501 x 10°! 
6.04153 x 10” 
2.65827 x 10™ 
1.19622 x 10° 
5.50262 x 10°’ 
2.58623 x 10°° 
1.24139 x 10° 
6.08282 x 10° 
3.04141 x 10™ 
1.55112 x 10% 
8.06582 x 10° 
4.27488 x 10° 
2.30844 x 10”! 
1.26964 x 107 
7.10999 x 10” 
4.05269 x 10”° 
2.35056 x 10” 
1.38683 x 10° 
8.32099 x 10°! 
5.07580 x 10 
3.14700 x 10° 
1.98261 x 10°” 
1.26887 x 10°? 
8.24765 x 10” 
5.44345 x 10 
3.64711 x 10 
2.48004 x 10° 
ere 0 


1.19786 x 10! 
8.50479 x 10'°! 
6.12345 x 10!° 
4.47012 x 10!% 
3.30789 x 10!” 


2.48091 x 10'% 
1.88549 x 19!!! 
1.45183 x 10! 
1.13243 x 10! 
8.94618 x 10''° 


7.15695 x 10!" 
GP rsnalsis< ila" 
4.75364 x 10!” 
3.94552 x 10'4 
3.31424 x 10! 
2.81710 x 10! 
VAIIT WO. 
2.10776 x 10! 
1.85483 x 10!°** 
1.65080 x 10'"° 
[MASS 72ect dis 
1.35200 x 10° 
1.24384 x 10!” 
1.15677 x 10'" 
1.08737 x 10!*6 
1.03300 x 1018 
9.91678 x 10'” 
9.61928 x 10>! 
9.42689 x 10! 
9.33262 x 10!> 
9.33262 x 10!°” 
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Table of Binomial Probabilities 


tN 


f&WNre OO WN — © 


WN © 


n> 


0 
1 
2 
3 
4 
5 
6 


NSYHNARPWN kK © 


On NMN PWN © 
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Appendix—C (Continued) 


Table of Binomial Probabilities 


wma KnAUMN fF WNK OO cad 


5987 3487 71074 0282 .0060 0010 .0001 .0000 0000 .0000 .0000 
saillSyl 3874 .2684 lal 0403 .0098 .0016 .0001 .0000 .0000 .0000 
.0746 BLS 3020 2859 1209 0439 .0106 0014 .0001 .0000 .0000 
0105 0574 2013 .2668 2150 ire 0425 0090 .0008 .0000 .0000 
0010 0112 0881 .2001 2508 2051 lS .0368 0055 0001 6000 
0001 0015 .0264 1029 .2007 2461 .2007 1029 .0264 0015 .000 1 
0000 0001 0055 .0368 SEs: 2051 2508 .2001 .088 1 0112 .0010 
0000 0000 .0008 .0090 0425 lily: .2150 .2668 .2013 0574 0105 
.0000 0000 0001 0014 .0106 .0439 1209 2335 3020 1937 .0746 
0000 .0000 .0000 .0001 .0016 .0098 .0403 1211 2684 3874 eilksy)| 
.0000 .0000 .0000 .0000 0001 .0010 .0060 .0282 1074 3487 5987 


OV MINDNAAR WN CO 


— 


5688 3138 0859 .0198 0036 0005 .0000 .0000 0000 0000 .0000 
295 3835 .2362 .0932 .0266 0054 .0007 0000 0000 .0000 .0000 
.0867 PSM 295 .1998 .0887 .0269 0052 .0005 .0000 0000 0000 
.0137 0710 PPS) .2568 1774 .0806 0234 .0037 .0002 .0000 .0000 
0014 0158 1107 .2201 .2365 1611 .0701 .0173 0017 0000 .0000 
.0001 0025 .0388 mez 2207 .2256 1471 .0566 .0097 .0003 .0000 
.0000 .0003 .0097 .0566 1471 2256 .2207 Shey) .0388 .0025 0001 
.0000 .0000 .0017 .0173 .0701 1611 2365 2201 1107 0158 .0014 
.0000 .0000 .0002 0037 0234 .0806 .1774 2568 e225 0710 .0137 
.0000 .0000 0000 0005 0052 .0269 0887 1998 .2953 P\lsii| 0867 
.0000 .0000 .0000 .0000 .0007 0054 .0266 .0932 2362 3835 3293 
.0000 .0000 .0000 .0000 .0000 .0005 .0036 .0198 0859 3138 5688 


=SSooewmrasHWAkpWN-O 


— 


5404 2824 0687 0138 0022 0002 0000 O00 0000 0000 0000 
3413 3766 .2062 0712 0174 0029 0003 .O000 .0000 O000 0000 
0988 2301 .2835 .1678 .0639 0161 0025 0002 0000 0000 0000 
.0173 0852 2362 2397, 1419 0537 O15 OLS 0001 0000 0000 
002 1 0213 B29 est |e) 1208 0420 0078 0005 0000 0000 
0002 .0038 0532 1585 2270 1934 1009 1029) 0033 0000 0000 
.0000 0005 OI55 0792 .1766 2256 .1766 0792 O1S5 005 0000 
.0000 .0000 .0033 .0291 1009 1934 2270 1585 10582 0038 0002 
.0000 0000 0005 .0078 0420 1208 NES 2Bil)1 1389 One 0021 
0000 .O000 0001 0015S 0125 10587 1419 2807 2862 0852 OL73 
0000 .0000 0000 0002 0025 0161 0639 1678 22080 emul 0988 
.0000 .0000 0000 .0000 .0003 0029 .O174 Gia b2 3766 3413 
.0000 .0000 0000 .0000 .0000 0002 0022 0138 2824 S404 


OmAIAYNM AWN CO 
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Table of Binomial Probabilities 


a Se eae 


x 10 20 30 40 50 60 .70 80 90 95 
1853 0281 .0033 .0003 .0000 .0000 0000 0000 .0000 .0000 
3294 eS 0228 .0030 0002 0000 0000 0000 O00 0000 


2745 PAI .0732 0150 .0018 .0001 .0000 .0000 -0000 0000 
1423 .2463 1465 .0468 .0085 0008 0000 0000 0000 0000 
0514 .2001 .2040 1014 0278 .0040 0002 .0000 .0000 0000 
.0137 1201 .2099 1623 .0667 0142 .0013 0000 .0000 .0000 
0028 0550 1649 1983 m2 0392 .0056 0002 .0000 0000 
.0004 .0197 .LOLO 1889 .1746 .0840 0185 0012 .0000 .0000 
0001 0055 0487 1417 1964 1417 0487 0055 .0001 .0000 
.0000 0012 0185 .0840 .1746 1889 .1010 0197 .0004 .0000 
.0000 .0002 0056 .0392 Pe 1983 1649 0550 .0028 .0001 
0000 ~~ .0000 .0013 0142 .0666 1623 PAN) 1201 0137 0008 
.0000 .0000 .0002 .0040 .0278 1014 .2040 .2001 0514 0061 


0 
Il 
a 
3 
4 
5 
6 
a 
8 
9 


.0000 .0000 .0000 .0008 0085 .0468 .1465 2463 1423 .0359 
.0000 0000 0000 0001 O18 0150 0732 Sil 2745 1463 
.0000 .0000 0000 0000 0002 0030 Q228 LG 3294 3706 
0000 .0000 0000 0000 0000 0003 0033 0281 ASS3 AOI 


1668 A225 0023 0002 .0000 .0000 .0000 .0000 0000 .0000 
3150 0957 .0169 0019 0001 0000 -0000 0000 .0000 .0000 
.2800 1914 .0581 .0102 .0010 0001 .0000 .0000 .0000 .0000 
1556) 2595 1245 0341 0052 0004 .0000 .0000 0000 .0000 
.0605 .2093 1868 .0796 .0182 0021 0001 0000 .0000 0000 
.O175 1361 2081 ABZ) 0472, 0081 0006 0000 .0000 .0000 
.0039 .0680 1784 1839 0944 0242 0026 0001 0000 0000 
.0007 .0267 1201 eT 1484 0571 0095 0004 .0000 .0000 
0001 0084 0644 1606 1855 .1070 .0276 0021 0000 0000 
.0000 .0021 .0276 1070 1855 1606 0644 0084 0001 .0000 
.0000 0004 0095 0571 1484 Mela 1201 .0267 .0007 0000 
0000 0001 0026 0242 0944 1839 1784 0680 .0039 .0001 
.0000 .0000 0006 0081 0472 JST) 2081 1361 0175 .0010 
0000 0000 .0001 0021 0182 .0796 1868 .2093 .0605 .0076 
.0000 .0000 .0000 0004 0052 0341 1245 22393 1556 0415 
.0000 .0000 .0000 .0001 0010 .0102 0581 1914 .2800 “S75 
.0000 .0000 .0000 .0000 0001 .0019 0169 AUB) 3150 374) 

OOOO 0002 .0023 0225 1668 4181 


418 // Probability and Statistics for Scientists and Engineers // 


0 
il 
2 
3 
4 
5 
6 
7 
8 
9 


BS iT 
3763 
.1683 
0473 
0093 
0014 
.0002 
.0000 
.0000 
.0000 
.0000 
.0000 
.0000 
0000 
0000 
0000 
0000 
0000 
.0000 


3774 
3774 
1787 
0533 
0112 
0018 
.0002 
0000 
0000 
0000 
0000 
0000 
.0000 
.0000 
.0000 
0000 
0000 
.0000 
0000 
0000 
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20 


.0180 
0811 
23 
EN 
PAIS 
1507 
0816 
.0350 
.0120 
.0033 
.0008 
.0001 
0000 
.0000 
0000 
.0000 
0000 
0000 
.0000 


0144 
.0685 
1540 
.2182 
.2182 
1636 
0955 
0443 
0166 
0051 
0013 
0003 
0000 
.0000 
.0000 
.0000 
.0000 
.0000 
0000 
.0000 


P 

-30 40 50 60 
0016 0001 0000 .0000 
.0126 .0012 .0001 .0000 
0458 .0069 .0006 .0000 
1046 0246 0031 .0002 
1681 0614 LOU 0011 
2017 1146 0327 .0045 
1873 1655 .0708 0145 
1376 1892 1214 .0374 
0811 .1734 1669 .0771 
.0386 1284 1855 1284 
0149 0771 1669 .1734 
.0046 .0374 1214 1892 
0012 0145 .0708 .1655 
.0002 0045 0327 1146 
.0000 0011 0117 .0614 
.0000 .0002 0031 0246 
.0000 -0000 .0006 .0069 
.0000 .0000 .0001 0012 
0000 .0000 .0000 0001 
0011 0001 .0000 .0000 
.0093 .0008 .0000 .0000 
.0358 .0046 .0003 .0000 
.0869 0175 .0018 .0001 
1491 .0467 .0074 0005 
1916 .0933 0222 0024 
1916 1451 0518 0085 
HSS) a 0961 0237 
0981 1797 1442 0552 
0514 1464 1762 .0976 
0220 .0976 1762 1464 
0077 0532 1442 HSV 
.0022 0237 0961 STH 
0005 0085 0518 1451 . 
.0001 0024 0222 0933 
0000 .0005 0074 0467 
.0000 .0001 0018 0175 
.0000 .0000 .0003 .0046 
.0000 .0000 .0000 0008 
0000 .0000 0000 0001 
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Table of Binomial Probabilities 


P 

n x 05 10 20 30 40 50 60 -70 80 90 95 
20 0 3585 1216 0115 .0008 .0000 .0000 .0000 .0000 .0000 0000 .0000 
i 3774 .2702 .0576 0068 0005 0000 0000 .0000 .0000 .0000 -0000 

2 1887 2852 .1369 .0278 .0031 .0002 .0000 .0000 .0000 .0000 .0000 

3 0596 1901 .2054 .0716 0123 0011 0000 .0000 .0000 .0000 0000 

4 0133 0898 .2182 1304 0350 .0046 .0003 0000 .0000 0000 .0000 

5 0022 0319 1746 1789 0746 .0148 0013 .0000 .0000 0000 .0000 

6 .0003 .0089 1091 1916 1244 .0370 .0049 0002 .0000 .0000 0000 

7 .0000 .0020 0545 1643 1659 0739 .0146 0010 .0000 0000 .0000 

8 .0000 .0004 0222 1144 .1797 1201 0355 .0039 .0001 0000 .0000 

°) .0000 0001 0074 0654 1597 .1602 .0710 .0120 0005 .0000 .0000 

10 .0000 .0000 .0020 .0308 SUTTAL .1762 eh 187/1 0308 .0020 .0000 .0000 

1 .0000 .O000 0005 .0120 0710 1602 Alsi) 0654 0074 0001 0000 

12 .0000 .0000 0001 .0039 0355 1201 1797 1144 0222 .0004 .0000 

13 .0000 .0000 .0000 .0010 .0146 0739 1659 1643 0545 .0020 0000 

14 .0000 .0000 .0000 .0002 .0049 .0370 1244 1916 1091 .0089 0003 

15 .0000 0000 .0000 .0000 .0013 .0148 .0746 .1789 .1746 0319 0022 

16 .0000 .0000 0000 .0000 .0003 .0046 0350 1304 .2182 0898 0133 

17 .0000 .0000 .0000 .0000 .0000 O11 0123 0716 .2054 1901 .0596 

18 .0000 .0000 .0000 .0000 .0000 .0002 0031 0278 1369 2852 1887 

19 .0000 .0000 .0000 0000 .0000 .0000 .0005 .0068 0576 .2702 3774 

20 .0000 .0000 .0000 .0000 .0000 .0000 .0000 .0008 0115 1216 3585 

21 3406 1094. .0092 .0006 .0000 .0000 .0000 .0000 .0000 .0000 0000 


0 
1 3764 2559) 0484 .0050 .0003 .0000 .0000 .0000 .0000 .0000 .0000 
2 1981 2837 Lalli 0215 .0020 0001 .0000 .0000 .0000 .0000 .0000 
3 .0660 .1996 1917 0585 .0086 0006 ~ .0000 .0000 .0000 .0000 0000 
4 .0156 .0998 .2156 1128 0259 .0029 .0001 .0000 .0000 0000 0000 
5) .0028 .0377 S35 -1643 0588 .0097 .0007 .0000 0000 0000 .0000 
6 .0004 0112 222 1878 1045 0259 0027 .0001 .0000 .0000 .0000 
7 .0000 .0027 0655 elles) 1493 0554 0087 .0005 .0000 .0000 .0000 
8 0000 0005 0286 1294 1742 .0970 0229 0019 .0000 .0000 .0000 
) 0000 .0001 .0103 0801 .1677 1402 0497 .0063 .0002 .0000 .0000 
10 0000 .0000 .0031 0412 1342 .1682 0895 .0176 .0008 .0000 £0000 
11 .0000 .0000 .0008 .0176 .0895 .1682 1342 0412 .0031 .0000 0000 
4 0000 .0000 .0002 ~ .0063 0497 .1402 1677 0801 .0103 .0001 0000 
13 .0000 .0000 .0000 0019 0229 .0970 1742 1294 .0286 .0005 -0000 
14 0000 .0000 .0000 .0005 .0087 0554 1493 1725 .0655 0027 -0000 
15 0000 .0000 0000 0001 0027 0259 .1045 1878 21222 0112 0004 
16 .0000 .0000 0000 .0000 .0007 .0097 0588 .1643 .1833 0377 0028 
17 .0000 0000 .0000 .0000 .0001 0029 0259 1128 2156 .0998 .0156 
18 0000 .0000 .0000 .0000 0000 .0006 0086 0585 1917 1996 .0660 
Ig) 0000 .0000 -0000 .0000 .0000 0001 0020 0215 LZ 2837 1981 
20 .0000 .0000 .0000 0000 .0000 .0000 .0003 0050 0484 553 3764 
21 .0000 .0000 .0000 .0000 .0000 .0000 0000 .0006 0092 1094 3406 
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o 


OOOO 
000 
0000 
0000 
0000 
0000 


£0000 


Table of Binomial Probabilities 


0074 
.0406 
1065 
ale) 
.2108 
1898 
1344 
.0768 
.0360 
.0140 


0000 


0000 
0000 
0000 
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Table of Binomial Probabilities 


ZS 


OWN NMBWW— © 


p 
30 40 50 60 70 80 90 95 q 
0002 0000 0000 0000 0000 0000 0000 0000 
0020 O00 1 0000 0000 0000 0000 0000 OO00 
0097 .0006 0000 0000 0000 .0000 0000 ~=—_.0000 
0305 .0028 0001 0000 .0000 0000 0000 .0000 
0687 .0099 .0006 0000 .0000 .0000 0000  .0000 
1177. 0265 = 0025. Ss «0001. ~=Sss.0000)S («0000 ~S— «0000 ~—0000 
1598 0560 0080 .0004 0000 .0000 .0000  .0000 
1761 .0960 .0206 0017. 0000 .0000 .0000 0000 
1604 1360 ~=- 0438 ~——«0053,-s«0002-—s—«‘«i‘Ok—S—“‘«é«‘OH—S~C«00 
1222 1612 0779 0141 0008 0000 0000  .0000 
0785 1612 1169 0318 .0026 .0000 0000 .0@0 
0428 .1367  .1488 0608  .0079 .0002 0000 0000 
0199 0988 1612 0988 0199 0008  .0000 0000 
0079 —-.0608 1488  .1367 .0428  .0028  .0000 .0000 
0026 —-.0318 1169 1612 .0785 .0088  .0000 .0000 
0008 .0141 0779 1612 .1222 .0236  .0003 .0000 
0002 = .0053—Ss««0438 1360 .1604 0530 .0014 0000 
0000 .0017  .0206 .0960 .1761 .0998 0058 0001 
0000 = .0004._—~— «00800560 ~——.1598 1552 0202 ~—-.0008 
0000 =.0001. += 0025. «0265.—1177 1960 .0574 ~—-.0050 
0000  .0000 .0006 .0099  .0687 1960  .1292  .0238 
0000 .0000 .0001 .0028 0305 .1493 .2215 0862 
0000 .0000 .0000 .0006 0097. 0815 2718 .2232 
0000 .0000 .0000 0001 .0020 .0283 2127 3688 
0000 ~«©.0000 ~=. .0000 ~S(«0000-Ss—«c0002-—(‘«‘xT——ié«*79B——s«w2920 
0001  .0000 .0000 0000 .0000 0000 0000  .0000 
0014. ~=.0000 ~=—.0000 += 0000 ~—— «0000 ~— 0000) «0000 -~—=.0000 
0074  .0008 .0000 0000 §©©.0000 ~=—- .0000-~—— 0000 ~——.0000 
0243. +0019 ~=.0001 ~=©.0000 ~=s «.0000-~S'«.00k00S—«000——«=t#00000 
0572. ~=«.0071._~S(«.0004~Ss—«i«000-—“(«C000-—C(iés«=iwt0000-—‘«=w000——«=t0000 
1030 .0199 0016 .0000 .0000 .0000 0000 § .0000 
1472 0442 0053 .0002 .0000 .0000 0000 0000 
1712. +0800 ~=—s «.0143.—S's«c0009-S's«0000,-—i—<sé«<—COHKH—s—“—té=—=j000—S—S—S(=—0000 
1651 1200 .0322 .0031 .0001 0000 0000 .0000 
1336 ~=—-«.1511 «0609 ~—Ss«.0088~=——«.0004._-~— «0000 ~—— «0000 ~——.0000 
0916 1612 .0974 0212 .0013  .0000 0000 0000 
0536 1465 1328 .0434 0042 .0001 0000 0000 
0268 1140 1550 .0760 .0115 .0003 0000 0000 
0115 —-.0760 1550 .1140 .0268  .0012 0000 .0000 
0042 ©«.0434.—Ss««1328-—S—s«i«C46SSsi«iw0536~—s«0040S— «0000 ~——.0000 
0013. 0212. + .0974 = «1612, s—«0916—S——«0118 ~=— 0001 ~—.0000 
0004 ~=—s-.0088.-——(‘(<zO—=“‘‘ SS SC«336=— 0294 »=— 0004 ~— 0000 
0001  —-.0031 0322 1200 .1651 .0623 0018 .0000 
0000 ©«©.0009-—Ss(«w0143.—s—«w0800~——«1712 1108 .0072 0001 
0000 0002 0053 0442 1472 1633. .0239 0010 
0000 0000 0016 «=: .0199_~—S—=—=#1030 1960  .0646  .0060 
0000 .0000 0004 0071 ~=.0572 1867  .1384 0269 
0000 .0000 0001 0019 .0243 1358  .2265  .0930 
‘0000 += «0000S —(«0000S'«w0004Ss«w0074—Ss«0708 =~ «2659 ~—.2305 
0000 ~=.0000 = «0000S 's(«w000—s—‘(‘wSsiw286H «1994S «3650 
0000 +0000 ~=— «0000 -=S(«0000-'—«w— (ss«Cwc0038-—s«718 = 274 
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Cumulative Poisson Distributions 


nr 

v 0.1 0.2 0.3 0.4 0.5 0.6 0.7 

0 9048 8187 .7408 .6703 6065 5488 4966 
1 0905 .1637 Pes .2681 .3033 3293 3476 
2 0045 .0164 033 .0536 0758 0988 1217 
3 .0002 OO11 .0033 0072 0126 0198 0284 
+ .0000 .0001 .0003 .0007 .0016 .0030 0050 
5 .0000 .0000 .0000 .0001 .0002 .0004 .0007 
6 .0000 .0000 0000 .0000 0000 .0000 0001 
7 0000 0000 .0000 .0000 .0000 .0000 .0000 


1.2 eS 1.4 1.7 1.8 1.9 


2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 
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Cumulative Poisson Distributions 


- 
+ 3.2 35 3.4 3.6 a7 3.8 a) 4.0 
0 0273 .0247 0224 0202 0183 
l 0984 0915 0850 .0789 0733 
2 TA Miofe)2 1615 B53 1465 
3 PAWS) .2087 .2046 .2001 1954 
4 ) ollSi3i 1944 1951 1954 
5) les? 1429 1477 21522 toes 
6 .0826 0881 0936 0989 1042 
7 0425 .0466 .0508 0551 0595 
8 pC 0215 0241 .0269 .0298 
9 .0076 0089 .0102 .0116 .0132 

10 0028 .0033 0039 0045 .0053 
11 .0009 0011 .0013 .0016 .0019 
12 .0003 .0003 .0004 0005 0006 
13 0001 0001 0001 .0002 0002 
14 .0000 0000 .0000 0000 .0001 
x 4.6 4.7 4.8 4.9 5.0 
0 0166 0150 .0136 0123 OLLL 0101 0091 0082 0074 0067 
! .0679 .0630 0583 0540 .O500 0462 0427 0395 0365 0337 
2 BOS oll 323) 1254 1188 all 125) 1063 1005 0948 0894 0842 
3 1904 allio .1798 .1743 .1687 1631 1574 piles! 7/ 1460 1404 
4 LOS 1944 all @33) SOY 1898 1875 1849 e2o 1789 all 7/25 
5 .1600 lO) .1662 1687 1708 al25 1738 1747 as aS 
6 .1039 1143 moi BOY 1281 B25 . WR@2 B88 1432 1462 
7 .0640 .0686 0732 .0778 .0824 0869 0914 0959 1002 1044 
8 0328 .0360 0393 0428 .0463 0500 0537 0575 0614 0653 
o 0150 .0168 0188 0209 0232 ONS: 0281 0307 0334 .0363 
10 .006 1 .0071 .008 | .0092 .0104 0118 -O182 O47 0164 0181 
1a .0023 .0027 .0032 .0037 .0043 0049 0056 .0064 .0073 0082 
2 0008 0009 O11 0014 .0016 .0019 0022 .0026 0030 0034 
13 0002 .0003 .0004 .0005 0006 .0007 0008 0009 O01! .0013 
14 0001 .0001 .0001 0001 0002 0002 003 0003 0004 005 
15 .0000 .0000 .0000 .0000 0001 0001 | 0001 0001 0001 .0002 
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Cumulative Poisson Distributions 


5.6 Sl) 5.8 


x 6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8 6.9 7.0 
0 .0022 .0020 .0018 .0009 

l 0137 .0126 .0116 .0064 
2 0417 .0390 .0364 0223 
3 0848 .0806 0765 0521 
4 .1294 1249 21205 0912 
5 lo 1549 Ae, medal 
6 1605 1601 1595 .1490 
} 1399 1418 1435 .1490 
8 .1066 .1099 1130 .1304 
y 0723 0757 0791 1014 
10 0441 .0469 0498 .0710 
11 0244 .0265 0285 0452 
12 0124 .0137 0150 .0263 
13 0058 .0065 .0073 0142 
14 .0025 0029 .0033 .007 1 
Is 0010 .0012 0014 .0033 
16 .0004 0005 0005 0014 
Uy .0001 .0002 .0002 .0006 
18 .0000 0001 0001 0002 
19 .0000 .0000 0000 


0001 
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Cumulative Poisson Distributions 


8.1 
.0003 
0025 
.0100 
.0269 
0544 


0882 
1191 
1378 
nligo> 
eS 


Oe 
0749 
0505 
0315 
0182 


.0098 
0050 
0024 
0011 
0005 


0002 
0001 
0000 
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Cumulative Poisson Distributions 
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Cumulative Poisson Distributions 


A 

1 is 14 15 16 17 

0000 0000 0000 0000 0000 0000 

0001 0000 0000 0000 0000 0000 

0004 0002 0001 0000 0000 0000 

0018 0008 0004 0002 0001 0000 

0053 0027 0013 0006 0003 0001 

0127 .0070 0037 0019 .0010 .0005 .0002 0001 -000 1 
0255 0152 0087 0048 .0026 0014 .0007 0004 .0002 
0437 0281 0174 0104 .0060 0034 0019 .0010 0005 
0655 0457 .0304 0194 .0120 0072 0042 .0024 0013 
0874 .0661 0473 0324 0213 0135 .0083 .0050 .0029 
1048 0859 0663 .0486 0341 0230 0150 .0095 0058 
1144 1015 0844 .0663 .0496 0355 0245 .0164 .0106 
1144 .1099 0984 .0829 0661 0504 0368 0259 .0176 
.1056 .1099 .1060 0956 0814 0658 0509 .0378 0271 
.0905 1021 .1060 1024 .0930 0800 0655 0514 0387 
0724 0885 0989 1024 0992 .0906 .0786 0650 0516 
0543 0719 .0866 .0960 0992 0963 0884 0772 .0646 
.0383 0550 0713 0847 .0934 .0963 .0936 0863 .0760 
0255 0397 0554 .0706 .0830 .0909 0936 0911 0844 
0161 10272 0409 0557 .0699 0814 0887 0911 0888 
.0097 0177 .0286 0418 0559 .0692 0798 .0866 0888 
0055 .0109 0191 .0299 0426 0560 0684 .0783 .0846 
0030 .0065 0121 .0204 0310 0433 0560 .0676 .0769 
.0016 0037 .0074 0133 0216 0320 0438 0559 .0669 
.0008 .0020 0043 .0083 0144 0226 0328 0442 0557 
.0004 .0010 0024 .0050 0092 0154 0237 .0336 .0446 
.0002 0005 0013 0029 0057 0101 0164 .0246 0343 
0001 0002 .0007 .0016 0034 0063 0109 0173 0254 
0000 0001 .0003 0009 0019 0038 .0070 0117 0181 
.0000 0001 .0002 .0004 O01! 0023 0044 .0077 0125 
.0000 .0000 .0001 .0002 .0006 0013 .0026 .0049 .0083 
.0000 .0000 .0000 0001 0003 .0007 0015 0030 .0054 
.0000 -0000 .0000 0001 0001 0004 .0009 0018 0034 
0000 .0000 .0000 0000 0001 0002 .0005 0010 .0020 
0000 .0000 0000 0000 .0000 0001 .0002 .0006 0012 
.0000 .0000 .0000 .0000 .0000 0000 0001 .0003 .0007 
.0000 .0000 .0000 0000 .0000 0000 0001 .0002 .0004 
.0000 0000 0000 .0000 .0000 0000 .0000 0001 .0002 
.0000 .0000 .0000 .0000 .0000 0000 0000 0000 .000 1 


.0000 .0000 .0000 0000 .0000 0000 0000 0000 .0001 
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Cumulative Standard Normal Distribution 


= 
1 — 


@(z)=P(Z<z)= e* du 


vm 


-0.09 —0.08 —0.05 —0.04 0.03 0.02 


0.01 


0.000042 
0.000064 
0.000096 
0.000142 
0.000208 
0.000302 
0.000434 
0.000619 
0.000874 
0.001223 
0.001695 
0.002327 
0.003167 
0.004269 
0.005703 
0.007549 
0.009903 
0.012874 
0.016586 
0.021178 
0.026803 
0.033625 
0.041815 
0.050551 
0.063008 
0.076359 
0.091759 
0.109349 
0.129238 
0.151505 
0.176185 
0.203269 
0.232695 
0.264347 
0.298056 
0.333598 
0.370700 
0.409046 
0.448283 
0.488033 


0.000044 
0.000067 
0.000100 
0.000147 
0.000216 
0.0003 13 
0.000450 
0.000641 
0.000904 
0.001264 
0.001750 
0.002401 
0.003264 
0.004396 
0.005868 
0.007760 
0,010170 
0.013209 
0.017003 
0.021692 
0.027429 
0.034379 
0,042716 
0.052616 
0.064256 
0.077804 
0.093418 
0.111233 
0.131357 
0.153864 
0.178786 
0.206108 
0.235762 


0.000033 |0.000034 
0.000050 |0.000052 
0.000075 |0.000078 
0.000112 |0.000117 
0.000165  |0.000172 
0.000242 |0.000251 
0.000350 }0.000362 
0.000501 }0.000519 
0.000711 }0.000736 
0.001001 |0.001035 
0.001395 |0.001441 
0.001926 |0.001988 
0.002635 |0.002718 
0.003573 |0.003681 
0.004799 |0,004940 
0.006387 {0.006569 
0.008424 |0.008656 
0.011011 |0,011304 
0.014262 }0.014629 
0.018309 {0.018763 
0.023295 [0.023852 


0.000036 — |0.000037 — |0.000039 — |0.000041 
0.000054 |0.000057 |0.000059 |0.000062 
0.000082 {0.000085 |0.000088 |0.000092 
0.000121 40.000126 |0.000131 |0.000136 
0.000179 |0.000185 |0.000193 {0.000200 
0.000260 |0.000270 |0.000280 |0.000291 
0.000376 |0.000390 }0.000404 |0.000419 
0.000538 |0.000557 |0.000577 |0.000598 
0.000762 {0.000789 |0.000816 |0.000845 
0.001070 |0.001107 }0.001144 |0.001183 
0.001489 |0.001538 |0.001589 0.001641 
0.002052 |0.002118 |0.002186 |0.002256 
0.002803 {0.002890 |0.002980 |0.003072 
0.003793 0.003907 0.004025 {0.004145 
0.005085 {0.005234 |0.005386 0.005543 
0.006756 {0.006947 |0.007143 |0.007344 
0.008894 |0.009137 |0,009387 0.009642 
0.011604 {0.011911 |0.012224 |0.012545 
0.015003 |0.015386 |0,015778 |0.016177 
0.019226 0.019699 {0.020182 {0.020675 
0.024419 |0.024998 |0.025588 |0.026190 
0.029379 }0.030054 |0.030742 |0.031443 |0.032157 0.032884 
0.036727 |0.037538 }0.038364 |0.039204 |0.040059 |0.040929 
0.045514 |0.046479 }0.047460 |0.048457 {0.049471 |0.050503 
0.055917 |0.057053 |0.058208 |0.059380 |0.060571 0.061780 
0.068112 |0.069437 |0.070781 |0.072146 |0.073529 |0.074934 
0.082264 |0,083793 0.085343 {0.086915 |0.088508 |0.090123 
0.098525 |0.100273 0.102042 |0.103835 0.105650 |0.107488 
0.117023 j0.119000 0.121001 |0.123024 |0.125072 |0.127143 
0.137857 }0.140071 {0.142310 0.144572 {0.146859 0.149170 
0.161087 |0.163543 |0.166023 0.168528 |0.171056 |0.173609 
0.186733 |0.189430 |0.192150 0.194894 |0.197662 |0.200454 
0.214764 |0.217695 |0.220650 |0.223627 |0.226627 {0.229650 
0.245097 |0.248252 |0.251429 |0.254627 |0.257846 |0.261086 
0.277595 }0.280957  |0.284339 |0.287740 |0.291160 |0.294599 
0.312067 |0.315614 |0.319178 |0.322758 |0.326355 |0.329969 
0.348268 |0.351973 |0.355691 |0.359424 |0.363169 |0.366928 
0.385908 |0.389739 |0.393580 |0.397432 |0.401294 |0.405165 
0.424655 |0.428576 |0.432505 |0.436441 |0.440382 {0.444330 
0.464144 |0.468119 |0.472097 |0.476078 |0.480061 |0.484047 


0.000046 
0.000069 
0.000104 
0.000153 
0.000224 
0.000325 
0.000467 
0.000664 
0.000935 
0.001306 
0.001807 
0.002477 
0.003364 
0.004527 
0.006037 
0.007976 
0.010444 
0.013553 
0.017429 
0.022216 
0.028067 
0.035148 
0.043633 
0.053699 
0.065522 
0.079270 
0.095098 
0.113140 
0.133500 
0.156248 
0.181411 
0.208970 
0.238852 
0.270931 
0.305026 
0.340903 
0.378281 
0.416834 
0.456205 
0.496011 


0.000048 
0.000072 
0.000108 
0.000159 
0.000233 
0.000337 
0.000483 
0.000687 
0.000968 
0.001350 
0.001866 
0.002555 
0.003467 
0.004661 
0.006210 
0.008198 
0.010724 
0.013903 
0.017864 
0.022750 
0.028717 
0.035930 
0.044565 
0054799 
0.066807 
0.080757 
0.096801 
0.115070 
0.135666 
0.158655 
0.184060 
0.211855 
0.241964 
0.274253 
0.308538 
0.344578 
0.382089 
0.420740 
0.460172 
0.500000 


—3.2 


-2.9 
2.8 
-2.7 
-2.6 
-2.5 
-2.4 
=2.3 


2.2 
—2.1 
-2.0 
-1.9 
-1.8 
-1.7 
-1.6 
—1.5 
-1.4 
-1.3 
-1.2 
-1 
-1.0 
-0.9 


Appendix—E (Continued) 


Cumulative Standard Norma! Distribution 


—un 


0.999954 


w(o=rizs-| e 2 du 
1 
i] 
®(Z) 
1 
i} 
0) Zs 
0.00 0.01 0.02 0.03 0.06 0.07, | 0.08 0.09 
0.500000 |0.503989 |0.507978 [0.511967 0.523922 [0.527903 [0.531881 [0.535856 
0.539828 |0.543795 |0.547758 |0.551717 0.563559 |0.567495 |0.571424 |0.575345 
0.579260 |0.583166 |0.587064 {0.590954 0.602568 |0.606420 |0.610261 |0.614092 
0.617911 {0.621719 |0.625516 |0.629300 0.640576 |0.644309 |0.648027 | 0.651732 
0.655422 |0.659097 |0.662757 | 0.666402 0.677242 |0.680822 |0.684386 |0.687933 
0.691462 |0.694974 |0.698468 | 0.701944 0.712260 |0.715662 |0.719043 |0.722405 
0.725747 |0.729069 |0.732371 |0.735653 0.745373 |0.748571 |0.751748 | 0.754903 
0.758036 |0.761148 |0.764238 |0.767305 0.776373 |0.779350 |0.782305 |0.785236 
0.788145 |0.791030 |0.793892 |0.796731 0.805106 |0.807850 |0.810570 |0.813267 
0.815940 |0.818589 |0.821214 |0.823815 0.831472 |0.833977 |0.836457 |0.838913 
0.841345 |0.843752 |0.846136 |0.848495 0.855428 |0.857690 |0.859929 |0.862143 
0.864334 |0.866500 {0.868643 |0.870762 0.876976 |0.878999 |0.881000 [0.882977 
0.884930 |0.886860 |0.888767 | 0.890651 0.896165 |0.897958 |0.899727 |0.901475 
0.903199 |0.904902 |0.906582 |0.908241 0.913085 |0.914657 |0.916207 | 0.917736 
0.919243 |0.920730 {0.922196 |0.923641 0.927855 |0.929219 |0.930563 |0.931888 
0.933193 |0.934478 {0.935744 |0.936992 0.940620 |0.941792 |0.942947 | 0.944083 
0.945201 |0.946301 |0.947384 |0.948449 0.951543 |0.952540 |0.953521 |0.954486 
0.955435 |0.956367 |0.957284 |0.958185 0.960796 |0.961636 |0.962462 | 0.963273 
0.964070 |0.964852 |0.965621 | 0.966375 0.968557 |0.969258 |0.969946 | 0.970621 
0.971283 |0.971933 |0.972571 |0.973197 0.975002 |0.975581 |0.976148 |0.976705 
0.977250 |0.977784 }|0.978308 | 0.978822 0.980301 |0.980774 |0.981237 /0.981691 
0.982136 |0.982571 |0.982997 |0.983414 0.984614 |0.984997 |0.985371 | 0.985738 
0.986097 |0.986447 |0.986791 |0.987126 0.988089 |0.988396 | 0.988696 | 0.988989 
0.989276 |0.989556 |0.989830 |0.990097 0.990863 {0.991106 [0.991344 |0.991576 
0.991802 |0.992024 |0.992240 |0.992451 0.993053 |0.993244 |0.993431 |0.993613 
0.993790 |0.993963 |0.994132 |0.994297 0.994766 |0.994915 |0.995060. |0.995201 
0.995339 |0.995473 |0.995604 |0.995731 0.996093 |0.996207 |0.996319 | 0.996427 
0.996533 |0.996636 {0.996736 | 0.996833 0.997110 |0.997197 |0.997282 | 0.997365 
0.997445 |0.997523 |0.997599 |0.997673 0.997882 |0.997948 |0.998012 |0.998074 
0.998134 |0.998193 |0.998250 | 0.998305 0.998462 |0.998511 |0.998559 |0.998605 
0.998650 |0.998694 |0.998736 |0.998777 0.998893 |0.998930 |0.998965  |0.998999 
0.999032 |0.999065 |0.999096 | 0.999126 0.999211 0.999238 |0.999264 |0.999289 
0.999313 [0.999336 |0.999359 |0.999381 0.999443 |0.999462 |0.999481 0.999499 
0.999517 |0.999533 |0.999550 | 0.999566 0.999610 |0.999624 |0.999638 | 0.999650 
0.999663 |0.999675 | 0.999687 | 0.999698 0.999730 |0.999740 |0.999749 | 0.999758 
0.999767 |0.999776 |0.999784 |0.999792 0.999815 |0.999821 | 0.999828 |0.999835 
0.999841 |0.999847 [0.999853 |0.999858 0.999874 |0.999879 |0.999883 | 0.999888 
(0.999892 |0.999896 |0.999900 |0.999904 0.999915 [0.999918 |0.999922 
0.999931 | 0.999933 |0.999936 0.999943 |0.999946 | 0.999948 
0.999956 |0.999958 0.999963 |0.999964 |0.999966 | 0.999967 
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chi-square Distribution Table 


Percentage Points y2, of the chi-square Distribution 


2.706 3.84] 
4.605 5.991 
6.251 7.815 
7.779 9.488 
9.236 11.070 


10.645 hey 
12.017 14.067 
13,362 15.507 
14.684 16.919 
15.987 18.307 


WITS 19.675 
18.549 21.026 
19.812 22.362 
21.064 23.685 
22.307 24.996 


23.542 26.296 
24.769 27.587 
25.989 28.869 
27.204 20.144 
28.412 31.410 


29.615 32.671 4 : : 

30.813 33.924 : ! 42.796 
32.007 SNe : 44.181 
33.196 36.415 : 45.559 
34.382 37.652 : 46.928 


35.563 38.885 : E 48.290 — 
36.741 40.113 ; 46.963 49.645 
37.916 41.337 : 48.278 50.993 
39.087 42.557 é 49.588 52.336 
40.256 43.773 ' 50.892 53.672 


51.805 55.758 : 63.691 66.766 
63.167 67.505 71.420 76.154 79.490 
74.397 79,082 83.298 88.379 ON952 
85.527 90.531 95.023 100.425 104.215 
96.578 101.879 106.629 | 112.329 116,321 


LOTS OSM sel45 118.136 124.116 128.299 
118.498 124.342 129.561 135.807 140.169 
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Appendix—G 


The ¢-Distribution Table 


Percentage Points t,, of the t-Distribution 


i=) 

Coa 
& 
<< 


SS 40 25 10 05 025 01 005 0025 001 0005 
1 | 325 1000 3.078 6314 12.706 31.821 63.657 127.32 318.31 636.62 
2 | .289 816 1.886 2.920 4.303 6.965 9.925 14.089 23.326 31.598 
Sa 277 765 1.638 ~—-2.353 3.182 4.541 5.841 7.453 10.213 12.924 
4 | 271 741 1.533" 2.132 2.716 3.747 4.604 5.598 7.173 8.610 
an 267 727 1476 ~—-2.015 257i 3.365 4.032 4.773 5.893 6.869 
6 | .265 718 1440 = 1.943 2.447 3.143 3.707 4.317 5.208 5.959 
7 | .263 TIL «1415 1.895 2.365 2.998 3.499 4.029 4.785 5.408 
8 | .262 106 1.397 ‘1.860 2.306 2.896 3.355 3.833 4.501 5.041 
9 | .261 703. 1.383 1.833 2.262 2.821 3.250 3.690 4.297 4.781 
10 | .260 700 1.372 ‘1.812 2.228 2.764 3.169 3.581 4.144 4.587 
11 | .260 697 1363 ‘1.796 2.201 2.718 3.106 3.497 4.025 4.437 
12 | .259 695 1.356 ‘1.782 2.179 2.681 3.055 3.428 3.930 4.318 
13 | .259 694 1350 1.771 2.160 2.650 3.012 3.372 3.852 4.221 
14 | .258 692 1345 1.761 2.145 2.624 2.977 3.326 3.787 4.140 
157) 258 691 1341 1.753 2.131 2.602 2.947 3.286 3.733 4.073 
16 | .258 690 1.337 1.746 2.120 2.583 2.921 3.252 3.686 4.015 
BE 257. 689 1.333 1.740 2.110 2.567 2.898 3,222 3.646 3.965 
18 1.257 688 1.330 1.734 2.101 2.552 2.878 3.197 3.610 3.922 
19 | .257 688 1.328 ‘1.729 2.093 2.539 2.861 3.174 3.579 3.883 
20) | 257 C87 mm 25) 1-725 2.086 2.528 2.845 3.153 3.552 3.850 
Bi) 257 686 1323 1.721 2.080 2.518 2.831 3.135 3.527 3.819 
221256 686 1321 1717 2.074 2.508 2.819 3.119 3.505 3.792 
23 | .256 685 1319 1714 2.069 2.500 2.807 3.104 3.485 3.767 
24 | .256 68 1318 L711 2.064 2.492 2.797 3,091 3.467 3.745 
25 | .256 684 1316 1.708 2.060 2.485 2.787 3.078 3.450 3.725 
26 | .256 684 1315 — 1.706 2.056 2.479 2.779 3.067 3.435 3.707 
27 | .256 684 1314 1.703 2.052 2.473 2.771 3.057 3.421 3.690 
28 | .256 683 1313 1.701 2.048 2.467 2.763 3.047 3.408 3.674 
29 | .256 683 1311 1.699 2.045 2.462 2.756 3.038 3.396 3.659 
30 | .256 683 1310 1.697 2.042 2.457 2.750 3.030 3.385 3.646 
40 | .255 681 1303 1.684 2.021 2.423 2.704 2.971 3.307 3.551 
60 | .254 679 1.296 ~—-1.671 2.000 2.390 2.660 2.915 3.232 3.460 

120 | .254 677 1.289: 1.658 1,980 2.358 2.617 2.860 3.160 3.373 | 
co | .253 674 __:1.282_ —_—:1.645 1.960 2.326 2.576 2.807 3.090 3.291 


v = degrees of freedom. 
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The #-Distribution Table 


Area in the Right Tail under the ¢-distribution Curve 
a : 0.05 0.025 0.01 0.005 
1 3,078 6.314 12.706 31.821 63.657 318.309 
2 1.886 2.920 4.303 6.965 9.925 22.327 
3 1.638 255 3.182 4.541 5.841 10.215 
4 1153335} 2.132 2.776 3.747 4.604 Tolls 
5 1.476 2.015 PS yi 3.365 4.032 5.893 
6 1.440 1.943 2.447 3.143 3.707 5.208 
7 1.415 1.895 2.365 2.998 3.499 4.785 
8 1.397 1.860 2.306 2.896 3,355 4.501 
9 1.383 1.833 2.262 2.821 3.250 4.297 
10 Len 1.812 2.228 2.764 3.169 4.144 
11 1.363 1.796 2.201 2.718 3.106 4.025 
Ile 1.356 1.782 2.179 2.681 3.055 3.930 
13 Less 0) 1.771 2.160 2.650 3.012 3.852 
14 1.345 1.761 2.145 2.624 ZO 3.787 
15 1.341 7a 2.1311 2.602 2.947 3.733 
16 13377 1.746 2.120 2.583 2.921 3.686 
17 1.333 1.740 2.110 2.567 2.898 3.646 
18 1.330 1.734 2.101 2552 2.878 3.610 
19 1.328 1.729 2.093 2.539 2.861 3.579 
20 1.325 1.725 2.086 2.528 2.845 S592 
21 1.323 72 2.080 2.518 2.831 B27 
2) 1,321 og 2.074 2.508 2.819 3.505 
23 1,319 1.714 2.069 2.500 2.807 3.485 
24 1.318 AI 2.064 2.492 2H 3.467 
25 1.316 1.708 2.060 2.485 2.787 3.450 
26 1.315 1.706 2.056 2.479 2.779 3.435 
2) 1.314 1.703 2.052 2.473 erilel 3.421 
28 sil 1.701 2,048 2.467 2.763 3.408 
29 Tesia 1.699 2.045 2.462 ATS) 3.396 
30 1.310 1.697 2.042 2.457 2.750 3.385 
31 1.309 1.696 2.040 2.453 2.744 Boh 
32 1309 1.694 2.037 2.449 2.738 3365 
33 1.308 1.692 2.035 2.445 2S 3.356 
34 1.307 1.691 2.032 2.441 2.728 3.348 
35 1.306 1.690 2.030 2.438 2.724 3.340 
36 1.306 1.688 2.028 2.434 2.719 3.333 
ai 1.305 1.687 2.026 2.431 PL TANS) 3.326 
38 1.304 1.686 2.024 2.429 Pa TIM 72 3.319 


2.023 


2.021 
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The #-Distribution Table 


3 


infinite 


1.645 


1.960 


2.326 


Area in the Right Tail under the t-distribution Curve 

4 0.05 0.025 0.01 0.005 0.001 
41 — 1.303 1.683 2.020 2.421 2.701 3.301 
42 1.302 1.682 2.018 2.418 2.698 3.296 
43 1.302 1.681 2.017 2.416 2.695 3.291 
44 1.301 1.680 2.015 2.414 2.692 3.286 
45 1.301 1.679 2.014 2.412 2.690 3.281 
46 1.300 1.679 2.013 2.410 2.687 BONH] 
47 1.300 1.678 2.012 2.408 2.685 3.273 
48 1,299 1.677 2.011 2.407 2.682 3.269 
49 e299 1.677 2,010 2.405 2.680 3.265 
50 1.299 1.676 2.009 2.403 2.678 3.261 
ail 1.298 1.675 2.008 2.402 2.676 3.258 
By 1.298 HOTS 2.007 2.400 2.674 3.255 
313) 1.298 1.674 2.006 2.399 2.672 S25 
54 1.297 1.674 2.005 2.397 2.670 3.248 
55 1.297 1.673 2.004 2.396 2.668 3.245 
56 1.297 1.673 2.003 2.395 2.667 3.242 
57 1,297 1,672 2.002 2.394 2.665 3.239 
58 1.296 1.672 2.002 2.392 2.663 SEY 
59 1.296 1.671 2.001 23M 2.662 3.234 
60 1.296 1.671 2.000 2.390 2.660 31232 
61 1,296 1.670 2.000 2.389 2.659 3.229 
62 1295 1.670 1.999 2.388 2.657 S227 
63 e295) 1.669 1.998 2.387 2.656 3.225 
64 1.295 1.669 1,998 2.386 2.655 322038 
65 E295 1.669 1.997 2.385 2.654 3.220 
66 1.295 1.668 1997 2.384 2.652 3.218 
67 1.294 1,668 1.996 2.383 2.651 3.216 
68 1,294 1.668 1995 2.382 2.650 3.214 
69 1.294 1.667 1.995 2.382 2.649 3.213 
70 1.294 1.667 1.994 2.381 2.648 3.211 
71 1.294 1.667 1.994 2.380 2.647 3.209 
WZ 1,293 1.666 1.993 20) 2.646 Be Or 
73 1.293 1.666 1.993 2.319 2.645 3.206 
74 1.293 1.666 O93 2.378 2.644 3.204 
15) 1.293 1.665 1.992 Peep 2.643 302 


2.576 


3.090 
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The F-Distribution Table 


Percentage Points f, .,, of the 


F-Distribution 


fo.2s, u,v 


Degrees of freedom for the numerator(u) 

1 2 3 4 5 6 7 8 9 10 12 15 20 24 
5.83 7.50 8.20 8.58 8.82 8.98 9.10 9.19 9.26 9.32 9.41 949 9.58 9.63 
257300) Susy 323 328) S53 34033587 S38 sos 4 S43 4s 
2.02 2.28 2.36 2.39 2.41 242 2.43 244 2.44 244 2.45 246 2.46 2.46 
1.81 2.00 2.05 2.06 2.07 2.08 2.08 2.08 2.08 2.08 2.08 2.08 2.08 2.08 


PWN 


5 | 1.69 1.85 1.88 1.89 1.89 1.89 1.89 1.89 1.89 1.89 1.89 1.89 1.88 1.88 

© |] Wee Maw iekS eA) eS) leas Wer) ily Mer ae oie Toate, Werke IS 

A || WS Terk) ee ee SAL eT TA WhO) e7kO) Weis) Mees) Tere} Mey Laie 

8 | 1.54 1.66 1.67 1.66 1.66 1.65 1.64 1.64 1.63 1.63 1.62 1.62 1.61 1.60 

|) WS Gee Mes} ess Nee oil ye@) te0) ILS) si) ist) ils! este) Il Sis 

S KON RAO EGO EGOMsIES OMe 9 eeles Seles leone lesOml: 5 Smleo+ ellos eels? males” 
= il |f ety SKS Glses Si ilsfe si) Tse! toe} ike) Iles) teil Myst) lah) 11.48: 
<I Wp They Slesieyaleste) alesis) fies! Vlesys) Slesy2 ilesyst Iles Test) shea Lei Ts) ILeks) 
a 13 | 145 1.55 1.55 1.53 1.52 1.51 1.50 1.49 1.49 148 147 146 145 1.44 
6 14 | 1.44 1.53 1.53 1.52 1.51 1.50 1.49 1.48 147 146 145 1.44 1.43 1.42 
s 15 | 143 1.52 1.52 1.51 149 148 1.47 146 146 1.45 144 1.43 141 141 
2 16 | 142 1.51 1.51 1.50 148 147 146 145 144 1.44 143 141 140 1.39 
: 17 | 142 1.51 1.50 149 1.47 146 1.45 144 143 143 141 1.40 1.39 1.38 
Ss 18 | 141 150 149 148 146 145 144 143 142 142 1.40 1.39 1.38 1.37 
§ 19 | 141 149 149 147 146 1.44 143 142 141 141 140 1.38 1.37 1.36 
Q 20 | 140 149 148 147 145 144 143 142 141 140 1.39 1.37 1.36 1.35 
2 21} 140 148 148 146 144 143 142 141 140 1.39 1.38 1.37 1.35 1.34 
2 22 ; 140 148 147 145 144 142 141 140 1.39 1.39 1.37 1.36 1.34 1.33 
o 23 | 1.39 1.47 147 145 143 142 141 1.40 1.39 1.38 1.37 1.35 1.34 1.33 
H 24 | 1.39 147 146 144 143 141 140 1.39 1.38 1.38 1.36 1.35 1.33 1.32 
a 25 | 1.39 147 146 144 142 141 140 1.39 1.38 1.37 1.36 1.34 1.33 1.32 
26 146 1.45 1.44 1.42 1.41 1.39 1.38 1.37 1.37 1.35 1.34 1.32 1.31 
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Cramer’s Rule 


Cramer’s Rule: Cramer’s rule can be used to solve a system of simultaneous linear algebraic equations. 


Consider a general system of n linear equations in n unknowns: 


Ged kbd x = b, 


lion 


a,x, ‘f a,x, cag ah a,x, = b, 


Ore Ge ot ax =b, (1.1) 


mn 


and define the determinant of such a system. We begin by defining the determinant of a 3 x 3 system: 


4, A 43 


a a heme | et eame 
- = 22 43 21 43 21 42 
C=|4,, Gy a3\/=a, A 


(1.2) 


a. 33 a3) 33 


a3, Ay 
43; 437 G33 


The general definition of the determinant of the n x n system of Equation (1.1) is simply an extension of 
the procedure (1.2). 


Qi &Q ++ Ay 
G2; 422 +++ Ann +1 

C=|. : = eat at CL)" a,,A,, (1.3) 
ant (enye oon (hry 


where AV is the (n — 1) x (n — 1) determinant obtained by crossing out the first row and ;" column of the 
original n x n determinant. Hence an n x | determinant can be obtained by calculating n(n — 1)(n — 1) 
determinants. 


We shall conclude this appendix by introducing a method for obtaining solutions of the system (1.1). 
We define the determinants 


Bb a2 ° Ay 


by Gy °° Ayn, 
C= : : ayaa 
b, a2 Ann 
a, B a3 Qn 
ay, b, ay ee 
C,= : 
An] b,, Qn3 °°’ Ann 
| 
Ay Ar 7 Ape Bo Ag Gn) 
Qn, on °° Gap By Angyy Ayn| : 
CG. lc : c 2 g : | 
Any Ang °° An k-l De Angst °° Ann 
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ay ap are 

Ay, Gy, °° Ann by p (1.4) 
C, =|: : : : 

an} an? eae Qn n-1 b, 


b, 


Then we have the following theorem, known as Cramer's rule. 


Cramer’s rule: Let C and C,, k = 1. 2, ..., n, be given as in (1.3). If C # 0, then the unique solution to the 
system (1.1) is given by the values 


Cee L5 
Ec? 7 ny (5) 


Example El.1 


Obtain the solution of the following simultaneous linear equations by Cramer’s rule. 


» [Bg 


fb 3222 |x, ls 
(b) 3. elec 
4 oie 2 
SOLUTION: 
_1Gil 
(a) Ae 
|C| 
12 -ll -5~36 
x, = =, = 
Seite. 24 ae 
4-1 
4 12' 12-20 
Xx) = = —— = 0.62 


(b) pel 
IC| 
a 
5 4 
2 2 =O) —44] 
gle a aren) 
3) 2 —63 
=) | 
42-9 
iS 
Be rl 
a eo Ja 
ae id 3 2) -63 
ot a | 
aao 2 =9 
ames 
Sa Ane) 
Blst 2 | 252 
Pe Ss 
3 4 1 
oo 
Example El.2 
Consider the system 
2X, 4k =k ——) 
Agee 3x, + Sx, = 14 
6x, + 3x, — 2x, =5 
We have 
2 4 -1 i weiter 
C=|(4 3 5)=112 C=|14 3 5)|=224 
6 -3 -2 2 i ae” 
2 -5 -1 2 4 -5 
C,=|4 14 5)/=-112 C3; =|-4 3 14/=560 
62 (Sarees 
Cy _ oh 
Therefore x, =— =2, ea hg = 
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|PROBLEMS | 


Solve the following system of linear equations using Cramer’s rule. 


PI.1(a) x+2y+z=0 
3x+ y—z=0 
x-y+4z=3 

(b) 2x+y—3z=11 
4x —2y+3z=8 
2x + 2y-z=-6 

(c) x+3y-z=-10 


—x+4y+2z=-4 

2x —2y + 5z= 35 

(d) y-3z=5 
2x+3y-—z=7 

4y+5y—22= 10 


iy tl ii 2 
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Glossary of Technical Terms 


Terminology used frequently in the field of Probability and Statistics is compiled here from various sources. 


t-Distribution: A continuous distribution with a specific type of bell-shaped curve with its mean equal to 0 


and standard deviation equal to ./df /(df — 2). 


z-Value or z-Score: The units of the standard normal distribution that are denoted by z. 


B-Beta: The probability of not rejecting a null hypothesis when it actually is false. (The probability of 
committing a Type II error). 


o (Sigma): The significance level of a test of hypothesis that denotes the probability of rejecting a null 
hypothesis when it actually is true. (The probability of committing a Type I error). 


a-Alpha: The probability that the statistical test will incorrectly reject the null hypothesis. 

.,: The mean of the paired differences for the population. 

o,: The standard deviation of the paired differences for the population. 

d: The difference between two matched values in two samples collected from the same source. It is called 
the paired difference. 


d : The mean of the paired differences for a sample. 


Absolute Dispersion: The actual variation or dispersion is called the absolute dispersion. 


Absolute Error:The difference, without sign, between the actual value and a statistically determined model 
value. 


all those value that would not be rejected with p =.05. 
Alpha: The probability that the statistical test will incorrectly reject the null hypothesis. 


Alternative Hypothesis: A claim about a population parameter that will be true if the null hypothesis is 
false. 


Analysis of Variance (ANOVA): A statistical technique used to test whether the means of three or more 
populations are equal. 


ANOVA: See analysis of variance. 
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Array: An array is an arrangement of raw numerical data in ascending or descending order of magnitude. 
Average: Another name for the mean. The sum of all the data values divided by the number of data values. 
Bar Graph: A graph made of bars whose heights represent the frequencies of respective categories. 
Bernoulli Trial: Consider a process that consists of a sequence of n independent trials, where the outcome 
of each trial is a success or a failure. These are called Bernoulli trials. 

Beta: The probability of making a Type II error when making a statistical hypothesis test. One minus this 
probability is power, the probability of correctly rejecting the null hypothesis. 

Biased Estimator: See unbiased estimator. 

Bimodal Distribution: A distribution that has two modes. 

Binomial Experiment: An experiment that contains n identical trials such that each of these 7 trials has 


only two possible outcomes, the probabilities of these two outcomes remain constant for each trial. and the 
trials are independent. 


Binomial Parameters: The total trails n and the probability of success p for the binomial probability 
distribution. 

Binomial Probability Distribution: The probability distribution that gives the probability of x successes in 
n trials when the probability of success is p for each trial of a binomial experiment. 

Box Plot: The box represents the middle 50% of the data values; that is, the data values from the first to 
third quartiles. The line or dot in the middle of the box denotes the median. Whiskers — lines extending from 
the box — extend either to the limits of the data or to a maximum length equal to 1.5 times the length of the 
box. In the latter case, points beyond the lines indicate outliers. 


Box-and-Whisker Plot: A plot that shows the center, spread and skewness of a data set by drawing a box 
and two whiskers using the median, the first quartile, the third quartile, and the smallest and the largest 
values in the data set between the lower and the upper inner fences. 


Categorical Variable: The values of a categorical variable are discrete categories. Usually the categories 
cannot be ordered from highest to lowest in any meaningful way. Rather, the categories just define different 
groups. Gender and religions are examples of categorical variables. Compare continuous variable. See also 


variable. 

Census: A survey conducted by including every element of the population. 

Central Limit Theorem: The theorem from which it is inferred that for a large sample size (n 2 30), the 
shape of the sampling distribution of x is approximately normal. Also, by the same theorem, the shape of 


the sampling distribution of p is approximately normal for a sample for which np > 5 and ng > 5. 
Chebyshev’s Theorem: For any number x greater than 1, at least (1 — 1/A°) of the values for any distribution 
lie within k standard deviations of the mean. 

chi-square Distribution: A distribution, with degrees of freedom as the only parameter, that is skewed to 
the right for small df and looks like a normal curve for large df. 

chi-square Test: A statistical test for determining whether two categorical variables are related. 

Class Boundary: The midpoint of the upper limit of one class and the lower limit of the next class. 


Class Frequency: The number of values in a data set that belongs to a certain class. 
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Class Length: (See class size) 

Class Midpoint or Mark: The class midpoint or mark is obtained by dividing the sum of the lower and 
upper limits (or boundaries) of a class by 2. 

Class Size: The size or width of a class interval is the difference between the lower and upper class 
boundaries. It is also referred to as the class width, class size, or class length. 

Class: An interval that includes all the values in a (quantitative) data set that fall within two numbers, the 
lower and upper limits of the class. 

Classical Probability Rule: The method of assigning probabilities to outcomes or events of an experiment 
with equally likely outcomes. 

Cluster Sampling: The sampling technique under which the population is divided into clusters and a sample 
is chosen from one or a few clusters. 

Cluster: A subgroup (usually geographical) of the population that is representative of the population. 
Coefficient of Determination: A measure that gives the proportion (or percentage) of the total variation in 
a dependent variable that is explained by a given independent variable. 

Coefficient of Variation: A measure of relative variability that expresses standard deviation as percentage 
of the mean. 

Combinations: The number of ways x elements can be selected from n elements. 

Complementary Events: Two events that are taken together include all the outcomes for an experiment but 
do not contain any common outcomes. 

Compound Event: An event that contains more than one outcome of an experiment. It is also called a 
composite event. 

Conditional Probability: The probability of an event subject to the condition that another event has already 
occurred. 

Confidence Interval: All the values for a parameter that would not be rejected if they were used as the null 
hypothesis in a statistical test of that parameter. A 95% confidence interval includes. 


Confidence Level: Confidence level, denoted by (1 — «) 100%, states how much confidence we have that a 
confidence interval contains the true population parameter. This may be thought of as the probability that 
a given assertion is true. More precisely a statement is true at C confidence level if it is true for fraction C 
of all notional populations that give the same sampling result as the one actually observed. 


Confidence Limit: Stated at C confidence level, the C confidence limits are the endpoints of the C confidence 
interval. 


Consistent Estimator: A sample statistic with a standard deviation that decreases as the sample size increases. 
Contingency Coefficient: Quantifies the degree of relationship between two categorical variables. 


Continuity Correction Factor: Addition of .5 and/or subtraction of .5 from the value(s) of x when the normal 
distribution is used as an approximation to the binomial distribution, where x is the number of successes in 
n trials. 

Continuous Data: Data, which can be described by a continuous variable, are called continuous data. 


Continuous Random Variable: A random variable that can assume any value in one or more intervals. A 
random variable whose values are countable. 
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Continuous Variable: (see variable) A variable, which can theoretically assume any value between two 
given values, is called a continuous variable, otherwise it is called a discrete variable. 


Control Group: The group on which no condition is imposed. 

Controllable Factor: A design variable that is considered to influence the response and is included in the 
experiment. The experimenters can control its level. 

Convenient Sample: A sample in which the most accessible members of the population are selected. 


Covariance: Assesses the degree to which two variables co-vary or vary together. If the two variables are 


independent then the covariance will equal zero. It is computed as the mean of the products of the mean 
deviations for each variable. 


Critical Value or Critical Point: One or two values that divide the whole region under the sampling 
distribution of s sample statistic into rejection and non-rejection regions. 

Cross-Section Data: Data collected on different elements at the same point in time or for the same period of 
time. 

Cumulative Frequency: The cumulative frequency of a class is the frequency of the class divided by the 
total frequency of all classes and is generally expressed as a percentage. 


Cumulative Frequency Distribution: The total frequency of all values less than the upper class boundary 
of a given class interval is called the cumulative frequency. A table representing such cumulative frequencies 
is called a cumulative frequency distribution, cumulative frequency table or simply a cumulative distribution. 
Cumulative Percentage: The cumulative relative frequency multiplied by 100. 

Cumulative Probability: The sum of the probabilities from the minimum possible value up to a certain value. 
Cumulative Probability Distribution: By cumulating probabilities we obtain cumulative probability 
distribution, which are analogous to cumulative relative frequency distributions. The function associated 
with this distribution is often called a distribution function. 


Cumulative Relative Frequency: The cumulative frequency of a class divided by the total number of 
observations. 

Curve Fitting: The general problem of finding equations of approximating curves, which fit gives sets of 
data, is called curve fitting. 

Data or Data Set: Collection of observations or measurements on a variable. 

Degrees of Freedom (df): The number of observations that can be chosen freely. For the estimation of 
using the ¢-distribution, the degrees of freedom are n — 1. 

Degrees of Freedom for a Simple Linear Regression Model: Sample size minus 2, that is, 7 — 2. 
Dependent Events: Two events for which the occurrence of one changes the probability of the occurrence 
of the other. 

Dependent Variable: The variable to be predicted or explained. 

Descriptive Statistics: Collection of methods that are used for organising, displaying and describing data 
using tables, graphs and summary measures. 

Designed Experiment: A study in which the experiment controls the assignment of elements to different 
treatment groups. 

Deterministic Model: A model in which the independent variable determines the dependent variable exactly. 
Such a model gives an exact relationship between two variables. 
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Discrete Data: Data, which can be described by a discrete variable, are called discrete data. 


Discrete Probability Distribution: Of a variation x can assume a discrete set of values with respective 
probabilities where we can say that a discrete probability distribution for x has been defined. 


Discrete Random Variable: A random variable whose values are countable. 
Discrete Variable: A variable that can have only certain discrete values. The same as a categorical variable. 


Dispersion: The degree to which numerical data tend to spread apart an average value is called the variation 
or dispersion of the data. The different measure of valuation or dispersion available is the range, mean 
deviation, semi-interquartile range, and the standard deviation. 


Distribution Function: A function giving, for every value x, the probability that a random variable X is 
lower than equal to x. 


Dot Plot: A simple graph in which each observation is represented by a dot at its score. 


Efficient Estimator: If the sampling distributors of the two statistics have the same mean (or expectation) 
the statistic with the smaller variance is called an efficient estimator of the mean while the other statistic is 
called an inefficient estimator. 


Element or Member: A specific subject or object included in a sample or population. 


Empirical Rule: For a specific bell-shaped distribution, about 68% of the observations fall in the interval 
(U — 6) to (u + 6), about 95% fall in the interval (u — 20) to (u + 20), and about 99.7% fall in the interval 
(u — 30) to (u + 30). 


Equally Likely Outcomes: Two (or more) outcomes or events that have the same probability of occurrence. 


Error: The amount of variation in the response caused by factors other than controllable factors included 
in the experiment. 


Estimate: The value of a sample statistic that is used to find the corresponding population parameter. 


Estimated or Predicted Value of y: The value of the dependent variable, denoted by } , that is calculated 
for a given value of x using the estimated regression model. 


Estimation: Estimation is the process of analysing a sample result in order to predict the corresponding 
value of the population parameter. The sample statistic that is used to estimate a population parameter. 


Estimator: The sample statistic that is used to estimate a population parameter. 
Event: A collection of one or more outcomes of an experiment. 


Expected Frequencies: The frequencies for different categories of a multinomial experiment or for different 
cells of a contingency table that are expected to occur when a given null hypothesis is true. 


Experiment: A process with well-defined outcomes that, when performed, results in one and only one of 
the outcomes per repetition. 


F-Distribution: A probability distribution for ratios of variances. Most commonly used in analysis of 
variance to compare the ratio of the variance of the means from a number of groups were the same. 
Factorial n: Factorial 1, denoted by x! is simply defined as n! = n (n — 1) (n—2)-* 1. 

Factorial: Denoted by the symbol “!” 


First Quartile: The value in a ranked data set such that about 25% of the measurements are smaller than 


this value and about 75% are larger. It is the median of the values that are smaller than the median of the 
whole data set. 
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Frequency Distribution: A tabular arrangement of data by classes put together with the corresponding 
class frequencies is called a frequency distribution or frequency table. 


Frequency Function: (See probability function). 


Frequency Polygon: A frequency polygon is a line graph made by connecting the mid points of the bars in 
a histogram by straight lines. 


Frequency Table: (See frequency distribution) 

Frequency: The number of times a certain data value occurs in the set of observations. 

Gaussian Distribution: Another name for the normal distribution. 

Goodness-of-Fit Test: A test of the null hypothesis that the observed frequencies for an experiment follow 
a certain pattern or theoretical distribution. 

Grouped Data: Data organised and summarised in a frequency-distributed manner are called grouped data. 
Histogram: A graph in which classes are marked on the horizontal axis and frequencies, relative frequencies, 
or percentages are marked on the vertical axis. Bars that are drawn adjacent to each other represent the 
frequencies, relative frequencies, or percentages or various classes. 

Hypothesis Test: A test of whether or not the null hypothesis should be rejected based on the data. 
Hypothesis: A hypothesis is an assertion made about a population. 

Impossible Event: An event that cannot occur. 

Independent Events: Two events for which the occurrence of one does not change the probability of 
occurrence of the other. 

Independent or Expianatory Variable: The variable included in a model to explain the variation in the 
dependent variable. 

Independent Samples: Two samples drawn from two populations such that the selection of one does not 
affect the selection of the other. 

Independent Variable: In an experiment, the levels of independent variables are controlled by the 
experimenter. Independent variables serve as inputs. Compare dependent variable. 

Inefficient Estimator: See efficient estimator. 

Inferential Statistics: Collection of methods that help make decisions about a population based on sample 
results. 

Interaction: Two factors are said to have interaction with each other if the influence of one on the response 
function is dependent on the value of the other. 

Intercept: In the following equation for a line such as a regression model, the parameter a is the intercept. 
y-intercept: The point at which the regression line intersects the vertical axis on which the dependent 
variable is marked. It is the value of y when x is zero. 

Interquartile Range (IQR): The difference between the third Quartile (Q3) and the first quartile (Q1). The 
IQR indicates the range of scores spanned n by the middle half of the data values. The box in a box plot 
represents the IQR. 

Intersection of Events: Intersection of events is given by the outcomes that are common to two (or more) 
events. 

Interval Estimate: An estimate of a population parameter given by two numbers between which the parameter 
may be considered to lie is called an interval estimate of the parameter. 
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Joint Probability: The probability that two (or more) events occur together. 


Judgment Sample: A sample that includes the elements of the population selected based on the judgment 
and prior knowledge of an expert. 


Kruskal-Wallis Test: A non-parametric statistical procedure for comparing medians across groups. 
Kurtosis: Kurtosis is the degree of peaked ness of a distribution relative to a normal distribution. 
Large Sample: The samples of size greater than 30 are called large samples. 


Law of Large Numbers: If an experiment is repeated again and again, the probability of an event obtained 
from the relative frequency approaches the actual or theoretical probability. 


Least-Squares Estimates of a and b: The values of a and b that are calculated by using the sample data. 


Least-Squares Method: The method used to fit a regression line through a scatter diagram such that the 
error sum of squares is a minimum. 


Least-Squares Regression Line: A regression line obtained by using the least squares method. 
Left-Tailed Test: A test in which the rejection region lies in the left tail of the distribution curve. 


Level of Significance: The maximum probability in testing a hypothesis is with which type I error is 
considered to be in risk is called the level of significance of the test. 


Linear Correlation Coefficient: A measure of the strength of the linear relationship between two variables. 
Linear Regression Model: A regression model that gives a straight-line relationship between two variables. 
Lower Inner Fence: The value in a data set that is 1.5 x IQR below the first quartile. 
Lower Outer Fence: The value in a data set that is 3.0 x IQR below the first quartile. 


Lower Quartile: The lowest 25% of the scores are equal to or less than the lower quartile score. Often 
represented as Q1; equal to the 25" percentile. 


Marginal Probability: The probability of one event or characteristic without consideration of any other 
event. 


Maximum Error of Estimate: The quantity that is subtracted from and added to the value of a sample 
statistic to obtain a confidence interval for the corresponding population parameter. 


Maximum: The largest data value in a set of observations. 


Mean of a Discrete Random Variable: The mean of a discrete random variable x is the value that is expected 
to occur per repetition, on average, if an experiment is performed a large number of times. The mean of a 
discrete random variable is also called its expected value. 


Mean of p: The mean of the sampling distribution of x , denoted x, is equal to the population mean 4. 


Mean Square Between Samples or MSB: A measure of the variation among means of samples taken from 
different populations. 


Mean Square Within Samples or MSW: A measure of the variation within data of all samples taken from 
different populations. 


Mean: The average of the data values, that is, the sum of all the data values divided by the number of data 
values. The mean makes the sum of squared errors as small as possible. 


Measure of Dispersion: Measures that give the spread of a distribution. The range, variance and standard 
deviation are three such measures. 
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Measures of Central Tendency: Measures that describe the centre of a distribution. The mean, median and 
mode are three of the measures of central tendency. 


Measures of Position: Measures that determine the position of a single value in relation to other values in 
a data set. Quartiles, percentiles, and percentile rank are examples of measures of position. 


Median Absolute Deviation (MAD): The median of all the absolute errors or deviations. The MAD describes 
the spread of the data values away from the model value. 


Median: The middle data value in a set of observation. To find the median, reorder the data from smallest to 
largest and find the middle observation; that is the median. If there is an even number of observations, 


then there will be two middle values; in that case, the average of those two middle values is the median. 
The median is also the SO" percentile. 


Minimum: The lowest data value in a set of observations. 
Mode: A value (or values) that occurs with highest frequency in a data set. 


Model: Specifies a predicted or representative value of reach observation in a data set. Simple models are 
the mean or the median for a set of observations. More complex models are. for example. regression lines. 


Multimodal Distribution: A distribution that has more than two modes. 


Multinomial Experiment: An experiment with n trials for which (1) the trials are identical, (2) there are more 
than two possible outcomes per trial, (3) the trials are independent and (4) the probabilities of various 
outcomes remain constant for each trial. 

Multiple Regression Model: A regression model that contains two or more independent variables. 
Mutually Exclusive Events: Two or more events that do not contain any common outcome and, hence, 
cannot occur together. 

Negative Relationship between Two Variables: The value of the slope in the regression line and the correlation 
coefficient between two variables are both negative. 

Nominal Variable: Same as categorical variable. Levels of these variables are noms or names. For example, 
the names for the levels of the gender variable are male and female. 

Non-linear (simple) Regression Model: A regression model that does not give a straight-line relationship 
between two variables. 

Non-parametric Statistics: Statistical methods that make no assumption that the data are from a normal 
distribution. 

Non-random Sample: A sample in which some members of the population may have no chance of being 
selected. 

Non-sampling Errors: The errors that occur during the collection, recording, and tabulation of data. 
Non-sampling or Systematic Error: The errors that occur in the collection, recording and tabulation of 
data. 

Normal Distribution: The probability distribution that describes or approximates the distribution of many 
variables. Most observations in a normal distribution occur near the mean. 


Normal Probability Distribution: The probability distribution of a continuous random variable that, when 
plotted, gives a specific bell-shaped curve. The parameters of the normal distribution are the mean wt and 


the standard deviation o. 


454 // Probability and Statistics for Scientists and Engineers // 


Null Hypothesis: The hypothesis that nothing is happening in the data that would not be expected by 
chance. Some null hypotheses are (a) the means of two groups are equal, (b) the slope in a simple regression 
is zero, (c) the correlation between two variables is zero, and (d) the classification of the data according to 
one nominal variable is independent of the classification of that data according to a second nominal variable. 


Number of Degrees of Freedom: The number of degrees of freedom (7) of a statistic generally is defined as 
the number N of the independent observatory in the sample (sample size) minus the number k of population 
parameters estimated from the sample observations. 


Observation or Measurement: The value of a variable for an element. 


Observation Study: A study in which the assignment of elements to different treatment is voluntary and the 
researcher simply observes the results of the study. 


Observation: A person, place, or thing for which a data value is available or is measured. 

Observed Frequencies: The frequencies actually obtained from the performance of an experiment. 
Observed Value of z or f: The value z or f calculated for a sample statistic such as the sample mean or the 
sample proportion. 

Ogive: A curve drawn for a cumulative frequency distribution. 

One-Way ANOVA: The analysis of variance technique that analyses one variable only. 


One-Tailed Test: A test in which there is only one rejection region, either in the left tail or in the right tail of 
the distribution curve. 


Ordinal Variable: A variable whose values indicate only rank and not the distance between values. For 
example, in a race, an ordinal variable would indicate who was first, second, third, etc., but would not give 
the time or distance between the racers. 


Outcome: An outcome is one particular result of an experiment. 
Outlier: An extreme or a typical value that is very different from other data values in the set of observations. 


Outliers or Extreme Values: Values that is very small or very large relative to the majority of the values in 
a data set. 


Paired or Matched Samples: Two samples drawn in such a way that they include the same elements and 
two data values are obtained from each element, one for each sample. Also called dependent samples. 


Parameter: An unknown value in a model that is estimated from the data. Examples: mean, median, intercept, 
and slope. 


Percentage: The percentage for a class or category is obtained by multiplying the relative frequency of 
that class or category by 100. 


Percentile Rank: The percentile rank of a value gives the percentage of values in the data set that are 
smaller than this value. 


Percentile: The P" percentile data value is the score that is equal to or greater than P% of all the data 
values. For example, the score equal to the 25" percentile is equal to or greater than one quarter of all the 
data values (and is also the first quartile). The 50" percentile necessarily is the median. Ninety-nine values 
that divide a ranked data set into 100 equal parts. 

Permutations: A permutation of n different objects taken r at a time is an arrangement of r out of the n 
objects with no special attention given to the order of arrangement. The number of combinations of n objects 


n! 


taken r at a time is denoted by np, and is given by np, = ———— 
" (n—r)! 
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Pie Chart: A circle divided into portions that represent the relative frequencies or percentages of different 
categories or classes. 


Point Estimate: An estimate of a population parameter given by a single number is called a point estimate 
of the parameter. 


Poisson Parameter: The average occurrences, denoted by A, during an interval for a Poisson probability 
distribution. 


Poisson Probability Distribution: The probability distribution that gives the probability of x occurrences 
in an interval when the average occurrences in that interval are A. 


Polygon: A graph formed by joining the midpoints of the tops of successive bars in a histogram by straight 
lines. 

Population Distribution: The probability distribution of the population data. 

Population or Target Population: The collection of all elements whose characteristics are being studied. 
Population Parameters for a Simple Regression Model: The values of A and B for the regression model 
y=A+Bx+e that are obtained by using population data. 


Population Parameters: The estimation of unknown population quantities (such as population mean, variance, 
etc.) often called population parameters from a knowledge of corresponding sample quantities (sample mean 
variance, etc.), often called sample statistics. 

Population Proportion p: The ratio of the number of elements in a population with a specific characteristic 
to the total number of elements in the population. 

Population: A population consists of all the individuals or objects in a well-defined group about which 
information is desired to answer a question. 

Positive Relationship between Two Variables: The value of the slope in the regression line and the 
correlation coefficient between two variables are both positive. 

Power: The probability that a statistical test will reject the null hypothesis given that some specific alternative 
model is true. Equivalently, the probability that a Type IJ error will be avoided. 

Prediction Interval: The confidence interval for a particular value of v for a given value of x. 

Probabilistic or Statistical Model: A model in which the independent variable does not determine the 
dependent variable exactly. 

Probability: The probability of an event occurring in an experiment equals the proportion of times the event 
occurs if the experiment is repeated a very large (infinite) number of times. 

Probability Distribution of a Discrete Random Variable: A list of all the possible values that a discrete 
random variable can assume and their corresponding probabilities. 

Probability Distribution: A distribution that determines the probability that a random variable takes any 
given value or belongs to a given set of values. 

Qualitative or Categorical Variable: A variable that cannot assume numerical values but is classified into 
two or more categories. 

Quantitative Data: Data generated by a quantitative variable. 

Quartiles: Summary measures that divide all the scores into four quarters. One-fourth of the data values 


are equal to or less than the first or lower quartile (Q1), one half of the values are equal to or less than the 
second quartile (Q2, also known as the median), and three fourths of all the data values are equal to or less 


than the third or upper quartile (Q3). 
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Quota Sample: A sample selected in such a way that each group or subpopulation is represented in the 
sample in exactly the same proportion as in the target population. 


Random Error Term (€): The difference between the actual and predicted value of y. 


Random Sample: A sample drawn in such a way that each element of the population has same chance of 
being included in the sample. 


Random Sampling: Each member of a population, which has an equal chance of being included, is called 
random sampling. 


Random Variable: A variable, denoted by x, whose value is determined by the outcome of a random 
experiment. Also called a chance variable. 


Randomisation: The procedure under which elements are assigned to different (treatment and control) groups 
at random. 


Range: The difference between the largest and smallest numbers is called the range of the data. 
Ranks: Data values, which indicate only the order of the observations. See ordinal variable. 
Raw Data: Data recorded in the sequence in which they are collected and before they are processed. 


Relative Cumulative Frequency: The relative cumulative frequency or percentage cumulative frequency is 
the cumulative frequency divided by the total frequency. 


Relative Dispersion: Relative dispersion is defined as the ratio of absolute dispersion and the average 
value. 


Relative Frequency: The frequency of a class or category divided by the sum of all frequencies. 
Representative Sample: A sample that contains the characteristics of the corresponding population. 
Right-Tailed Test: A test in which the rejection region lies in the right rail of the distribution curve. 
Rules of Decision: (See Tests of Significance). 


Sample: A selection of observations from a larger population of all possible observations. Ideally, the sample 
is selected randomly from the population. 


Sample Point: An outcome of an experiment. 


Sample Proportion p: The ratio of the number of elements in a sample with a specific characteristic to the 


total number of elements in that sample. 


Sample Space: A sample space is the set of all possible outcomes of an experiment. These outcomes are 
also called the elements of the sample space. 


Sample Statistics: See population parameters. 
Sample Survey: A survey that includes elements of a sample. 


Sampling Distribution: The probability distribution of the statistic one would get by sampling over and 
over again. 


Sampling Distribution of x : The probability distribution of all the values of ¥ calculated from all possible 
samples of the same size selected from a population. 


Sampling Distribution of p: The probability distribution of all the values of p> calculated from all possible 
samples of the same size selected from a population. 
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Sampling Error: The difference between the values of a sample statistic calculated from a random sample 
and the value of the corresponding population parameter. This type of error occurs due to chance. 


Sampling Frame: The list of elements of the target population that is used to select a sample. 


Sampling or Chance Error: The difference between the results obtained from a sample survey and the 
result that would be obtained from the census. 


Scatter Diagram or Scattergram: A plot of the paired observations of x and y. 


Scatter Plot, Scattergram: A two-variable plot in which each point represents the levels of an observation 
on each variable. Often the variable on the horizontal axis is to be used to predict the variable on the 
vertical axis. Scatter plots are useful for viewing correlations and for simple regression. 


z-Score Formula: The difference between the observation and the mean, divided by the standard deviation. 
S; The standard deviation of the paired differences for a sample. 
Second Quartile: Middle or second of the three quartiles that divide a ranked data set into four equal 


parts. About 50% of the values in the data set are smaller and about 50% are larger than the second quartile. 
The second quartile is the same as the median. 


Selection Error: The error that occurs because the sampling frame is not representative of the population. 
Significance Level: The pre-specified probability of a Type I error below which the null hypothesis is 
considered to be so rare as to be implausible. 


Significant: When the probability of obtaining a statistical value is below the significance level, the statistic 
is said to be statistically significant. 


Simple Event: An event that contains one and only one outcome of an experiment. It is also called an 
elementary event. 

Simple Linear Regression: A regression model with one dependent and one independent variable that 
assumes a straight-line relationship. 


Simple Random Sample: A sample chosen in such a way that each element of the population has the same 
probability of being included in the sample. 


Sixtieth Percentile: The sixtieth percentile is larger than 60% of the scores (and smaller than 40% ). 
Skewed: A skewed distribution is not symmetric. That is, it has different shapes on each side of the median. 
For example: 

Skewed-to-the-Left Histogram: A histogram with a longer tail on the left side. 

Skewed-to-the-Right Histogram: A histogram with a longer tail on the right side. 


Skewness: Skewness is the degree of asymmetry or departure symmetry of a distribution. Of the frequency 
curve of a distribution has a longer tail to the right of the central maximum than to the left, the distribution 
is said to be skewed to the right or 5 to have positive skewness. Of the reverse is true, it is called skewed to 


the left or to have negative skewness. 

Slope: The coefficient of x in a regression model that gives the change in y for a change of one unit in x. 
Small Samples: The samples of site lesser than 30 are called small samples. 

SSB: The sum of squares between samples. Also called the sum of squares of the factor or treatment. 

SSE (error sum of squares): The sum of the squared differences between the actual and predicted values 
of y. It is the portion of the SST that is not explained by the regression model. 
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SSR (regression sum of squares): The portion of the SST that is explained by the regression model. 


SST: The total sum of squares given by the sum of SSB and SSW. 
SST (total sum of squares): The sum of the squared differences between actual y value and y. 


SSW: The sum of squares within samples. Also called the sum of squares of errors. 


Standard Deviation: A measure of spread that is given by the positive square root of the variance. 


Standard Deviation of x: The standard deviation of the sampling distribution of x , denoted by o,, is 


equal to o/ Vn when n/N < .05. 
Standard Deviation of a Discrete Random Variable: A measure of spread for the probability distribution of 
a discrete random variable. 


Standard Deviation of Errors: A measure of spread for the random errors. 
Standard Deviation of p: The standard deviation of the sampling distribution of p, denoted by 95;, is 


equal to o/ Vn when n/N <.05. 
Standard Error: The standard deviation of the sampling distribution for any given statistic. 
Standard Error of Mean: The standard deviation of the sampling distribution of the mean. 


Standard Normal Distribution: The normal distribution with = 0 and o = |. The units of the standard 
normal distribution are denoted by z. 


Statistic: A summary measure calculated for sample data. 


Statistics: Statistics deals with scientific methods for collecting, organising, summarising, presenting and 
analysing data, as well as drawing valid conclusions and making reasonable decisions based on such 
analysis. 

Statistical Hypothesis: Assumptions or guesses made which may or may not be true, are called statistical 
hypotheses and in general are statements about the probability distributions of the populations. 


Stem-and-Leaf Display: A display of data in which each value is divided into two portions, a stem and a 
leaf. 


Stratified Random Sampling: The sampling technique under which the population is divided into different 
strata and a sample is chosen from each stratum. 


Stratum: A subgroup of the population whose members are identical with regard to the possession of a 
characteristic. 


Subjective Probability: The probability assigned to an event based on the information and judgment of a 
person. 


Sure Event: An event that is certain to occur. 
Survey: Collecting data on the elements of a population or sample. 


Symmetric: A symmetric distribution has the same shape on each side of the median. The normal distribution 
is an example of a symmetric distribution. Compare skewed. 


Symmetric Histogram: A histogram that is identical on both sides of its central point. 


Systematic Random Sampling: Sampling method used to choose a sample by selecting every Ath unit from 
the list. 
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Target Population: The collection of all subjects of interest. 


Target Value: A value that a product is expected to posses. Most often this value is different from what a 
single unit actually does exhibit. For a 9 volt transistor battery the target value is 9 volts. 


Test of Homogeneity: A test of the null hypothesis that the proportions of elements that belong to different 
groups in two (or more) populations are similar. 


Tests of Hypothesis: Procedures, which will enable to decide whether to accept or reject hypothesis or to 
determine whether observed samples differ significantly from expected results are called tests of hypothesis; 
tests of significance, or rules of decision. 


Test of Independence: A test of the null hypothesis that attributes of a population is not related. 


Test Statistic: The value of z or t calculated for a sample statistic such as the sample mean or the sample 
proportion. 


Tests of Significance: (see tests of significance) 


Third Quartile: Third of the three quartiles that divide a ranked data set into four equal parts. About 75% 
of the values in a data set are smaller than the value of the third quartile and about 25% are lager. It is the 
median of the values that are greater than the median of the whole data set. 


Tree Diagram: A diagram in which each outcome of an experiment is represented by a branch of a tree. 


Trimmed Mean: The k% trimmed mean is obtained by dropping k% of the smallest values and k% of the 
largest values from the given data and then calculating the mean of the remaining (100 — 2k)% of the values. 


Two-Tailed Test: A two-tailed test considers whether the null hypothesis might be wrong in either direction: 
that is, it allows the possibility that the true parameter might be either above or below the value of the null 
hypothesis. Compare one-tailed test. 


Type I Error: An error that occurs when a true null hypothesis is rejected. 

Type If Error: An error that occurs when a false null hypothesis is not rejected. 

Unbiased Estimator: Of the mean of the sampling distribution of a statistic equals to the corresponding 
population parameters, the statistic is called an unbiased estimator of the parameters, and otherwise it is 
called a biased estimator. 

Uniform or Rectangular Histogram: A histogram with the same frequency for all classes. 

Unimodal Distribution: A distribution that has only one mode. 

Union of Two Events: All outcomes that belong either to one or to both events. 

Upper Inner Fence: The value in a data set that is 1.5 x IQR above the third quartile. 

Upper Outer Fence: The value in a data set that is 3.0 x IQR above the third quartile. 


Upper Quartile: The 75" percentile. The observation that is greater than or equal to 75% of the observations 
and less than 25% of the observations. Sometimes designated by Q.. 


p-Value: The smallest significance level at which a null hypothesis can be rejected. 


Variable: A characteristic, trait, attribute, or measurement that can take on different values. A variable must 
vary, having at least one value for some observations and another value for other observations. 


Variables, Factors, or Parameters: These words are used synonymously to indicate the controllable factors 
in an experiment. In the case of a plastic molding experiment, molding temperature, injection pressure, set 


time, etc., are factors. 
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Variance: The typical or average squared error. For a population, the sum of squared errors is divided by n 
the number of observations. More often, interest is in the sample estimate of the variance in which case the 
sum of squared errors is divided by n — 1. 


Variation: —See dispersion— 
Venn Diagram: A picture that represents a sample space or certain events. 


y-intercept: The point at which the regression line intersects the vertical axis on which the dependent 
variable is marked. It is the value of y when x is zero. 


0105 


Glossary of Symbols 


(1) y-intercept in the population regression model; (2) event A 


y-intercept in the regression model estimated for sample data 
Complement of event A 


(Greek letter alpha). Probability of Type I error or significance level 
Confidence level 

Slope of the population regression line 

Slope of the regression model estimated for sample data 

(Greek letter beta). Probability of Type II error 

Number of columns in a contingency table 

(x is Greek letter chi). chi-square distribution 

Difference between two paired values 

Mean of the paired differences for a sample 

Number of degrees of freedom 

(1) Maximum error of estimate; (2) Expected frequencies 

Simple event i for an experiment 

(1) Random error, y — y , for a regression model, (2) e = 2.71828 in the Poisson probability 
distribution 

Expected value of a random variable x 

(Greek letter epsilon). Random error term in the population regression model 
Frequency of a category or class 

The F-distribution 

Null hypothesis 
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A, Alternative hypothesis 
IQR Inter quartile range 
k (1) Number of different samples or treatments in a one-way ANOVA; (2) Number of categories 


in a multinomial experiment 
(Greek letter lambda). Mean of the Poisson probability distribution 
Midpoint of a class 


U (Greek letter mu). Population mean 

LL, Mean of the sampling distribution of b 

LL, Mean of the paired differences for the population 

Lj Mean of the sampling distribution of q 

i= Mean of the sampling distribution of x 

ble Mean of the sampling distribution of p 

H., Mean of y for a specific value of x using population regression model 


n Sample size 


n! n factorial 
i 
k Number of combinations of n elements selected x at a time 
N Population size 
O Observed frequencies for a category or cell 
p (1) Population proportion; (2) Probability of success for a binomial experiment 


p Sample proportion 


Pooled sample proportion for two samples 
i kth percentile, where k is an integer from 1 to 99 
P(A) Probability that event A will occur 
P(AandB) — Joint probability of events A and B 
P (A or B) Probability of the union of events A and B 


P(A|B) Probability that event A will occur given event B has occurred 
q | — p, probability of failure for a binomial experiment 
q | — p,where p is the sample proportion 


| — p, where p is the pooled sample proportion for two samples 


First, second, and third quartiles, respectively 


ado 
'o 
© 


Number of rows in a contingency table 
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r Linear correlation coefficient for sample data 
ea Coefficient of determination 
p (Greek letter rho). Linear correlation coefficient for population data 
S Sample space 
s Sample standard deviation 
ie Sample variance 
S, Estimator of 0, 
S, Standard deviation of the paired differences for a sample 
SZ Estimator of 07 
S, Standard deviation of errors for the sample regression model 
5, Pooled standard deviation 
Sp Estimator of ©; 
Sip, Estimator of Of, 
Sz Estimator of o, 
Sz_z, Estimator of 9; _< 
S» Estimator of 9). 
eS, Estimator of 9%), 
»y (Greek letter capital sigma). Summation notation 
Oo (Greek letter lower-case sigma). Population standard deviation 
oO Population variance 
G, Standard deviation of the sampling distribution of b 
0, Standard deviation of the paired differences for the population 
OF Standard deviation of the sampling distribution of qd 
Oo. Standard deviation of errors for the population regression model 
Oo; Standard deviation of the sampling distribution of p 
O5-A, Standard deviation of the sampling distribution of p, — p, 
om Standard deviation of the sampling distribution of x 
4%, Standard deviation of the sampling distribution of x, — x, 


Standard deviation of ) when estimating [,,, 
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O> Standard deviation of ) when predicting y, 


t The ¢-distribution 
7. Sum of the values included in sample i in one-way ANOVA 
x 


(1) Variable; (2) Random variable; (3) Independent variable in a regression model 


as Sample mean 

y (1) Variable; (2) Dependent variable in a regression model 
y Estimated or predicted value of y using a regression model 
z Units of the standard normal distribution 


0095 


Key Formulae 


Notation: The following notation is used. 


3 


= sample size 

= sample mean 

= sample std. dev. 

= j* quartile 
population size 

= population mean 

= population std. dev. 


&a AF 20 7 
| 


= paired difference 
= sample proportion 
= population proportion 


observed frequency 


es) Qe 8 laws) 
{| 


= expected frequency 


Chapter 1: Numerical Descriptive Measures 
~ Relative frequency of a class = f/2f 
“ Percentage of a class = (relative frequency) x 100 
“+ Class midpoint or mark = (upper limit + lower limit)/2 
“ Class width = upper boundary — lower boundary 


cumulative frequency 


oo (Cin evanze il hye lige ll =| === =a He 
: q y total observations in the data set 


“ Cumulative percentage = (cumulative relative frequency) x 100 
“ Mean for ungrouped data: pp = Xx/N and X = Xx/n 
“ Mean for grouped data: p = Zmf/N and x = Xmf/n 
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where it is the midpoint and fis the frequency of a class 
n+l ; 
¢ Median for ungrouped data = value of the (a2) th term in a ranked data set 


“ Range = largest value — smallest value 

Variance for ungrouped data 

_ Gx" 
n 

ad 

where 6” is the population variance and s’ is the sample variance 


52 ay 
o? = —__‘_ and 5? = 


pie 


i? 
bd 


Standard deviation for ungrouped data 


where 6 and s are the population and sample standard deviations, respectively 


“+ Variance for grouped data 


_ mf y m2 ¢ my” 
N 2 a 


=m? £ 
oy = Natl Ge 
N n 


“+ Standard deviation for grouped data 


e, 
we 


Sample standard deviation 


s= jails or s= eet 
n-1l n-1 
aye 


Population mean (mean of a variable: 1 = —+) 


K? 
bd 


“> Population standard deviation (standard deviation of a variable) 


SC, —py? [a2 

(oy = 2(%; —W)" or Co= ee 
N N 
Tt 


f 3 ete 
¢+ Standardised variable: z = 


Go 
“* Chebyshev’s theorem 


For any number k greater than 1, at least (1 — 1/k’) of the values for any distribution lie within k 
standard deviations of the mean. 
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KO 
~~ 


0, 
Ww 


Empirical rule 


For a specific bell-shaped distribution, about 68% of the observations fall in the interval (uu — G) to 
(u + 6), about 95% fall in the interval (U — 26) to (u + 20), and about 99.7% fall in the interval 


(U— 30) to (u+ 30). 

Interquartile range: IQR = Cc. 

where Q, is the third quartile and Q, is the first quartile 
Lower limit = Q, — 1.5 IQR 

Upper limit = Q, + 1.5 IQR 

The k" percentile 


kn 
an = value of the (= th term in a ranked data set 


; number of values less than x, 
Percentile rank of x, = ——@_———__——$___-+__$£— 
‘ — total number of values in the data set 


Chapter 2: Probability 


2 
“~ 


ae 


¢ 


Ke 
~W 


2 
we 


Classical probability rule for a simple event 
1 
P(E; ) = ———_—————— 
total number of outcomes 
Classical probability rule for a compound event 


number of outcomes in A 
P(A) = ——— 
total number of outcomes 


Relative frequency as an approximation of probability 


P(A) = Lf 
n 
Conditional probability of an event 
P(A and B) P(A and B) 
P(A| B) = ————— _ and P(B| A) = ————. 
(AB) Der 


Condition for independence of events 
P(A) = P(A|B) and/or P(B) = P(B| A) 

For complementary events: P(A) + P(A) =1 
Multiplication rule for dependent events 

P(A and B) = P(A) P(B|A) 
Multiplication rule for independent events 

P(A and B) = P(A) P(B) 
Joint probability of two mutually exclusive events 

P(A and B) =0 
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io, 
Ww 


, 
bd 


2 
bd 


Addition rule for mutually nonexclusive events 
P(A or B) = P(A) + P(B) — P(A and B) 
Addition rule for mutually exclusive events 
P(A or B) = P(A) + P(B) 
Special addition rule 
P(A or B or C or...) = P(A) + P(B) + P(C) +» (A, B, C,... mutually exclusive) 
Complementation rule: P(E) = 1 — P (not E) 
General addition rule: P(A or B) = P(A) + P(B) — P(A and B) 
General multiplication rule: P(A and B) = P(A)-P(BIA) 
Special multiplication rule: P(A and B and C and ...) = P(A):P(B)-P(C)::: (A, B, C, ... independent) 
Rule of total probability 


ik 
P(B) = 9) P(A;)-P(B| A;) 


j=l 
(A,, A,, ..., A, mutually exclusive and exhaustive) 
Bayes’s rule 

P(A,)-P(B|A;) 

k 
>» jr P(Aj)-PCB | A;) 
(A,, A,, ..., A, mutually exclusive and exhaustive) 
Factorial: k! =k(kK—1)-:- 2-1 


P(A, | B)= 


m! 


Permutations rule: mP. = 
(m-r)! 


Special permutations rule: mP_ = m! 


! 
Combinations rule: mC, = mete 
r\(m-r)! 
N! 
Number of possible samples: NC,, = ————— 
n\(N —n)! 


Chapter 3: Random Variables and Probability Distributions 


o, 
“ 


o, 
“~ 


¢, 


Oo 


Mean of a discrete random variable X: tb = XxP(X = x) 
Standard deviation of a discrete random variable X 


O=JX(x-Wy P(X =x) of O= VEX P(X = x)-p? 


Factorial: k! =k(k-—1)-:-2-1 


F ; - n n ! 
Binomial coefficient: ———— 
xX} x\(n-x)! 
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** Binomial probability formula 


a) — | p ilpy 


where, n denotes the number of trials and p denotes the success probability. 
Mean of a binomial random variable: 1 = np 


% 
“~ 


“> Standard deviation of a binomial random variable: o = np(1— p) 


x 
“ Poisson probability formula: P(X = x)= e~* x 
Xt 
“* Mean of a Poisson random variable: 1 =A 
“> Standard deviation of a Poisson random variable: g = AY 
Discrete Random Variables and their Probability Distributions 


“ Mean of a discrete random variable x: pp = XxP(x) 


“+ Standard deviation of a discrete random variable x: o = 4 Xx P(x) — pw 
“+ n factorial: n! = n(n —1)(n—2) + 3-2-1 
¢* Number of combinations of n items selected x at a time 


ny n!} 
x) x'(n—x)! 


¢+ Binomial probability formula 


n —Xx 
P(x) = pq 
x 
“ Mean and standard deviation of the binomial distribution 


U=npand o=./npq 
¢ Poisson probability formula 


ie 


x! 


P(x) = 
“ Mean, variance, and standard deviation of the Poisson probability distribution 


WeaA,o = and a=) 


Continuous Random Variables and the Normal Distribution 


¢ z value for an x value 


4 Value of x when i, o, and z are known: x = t+ zo 
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Chapter 4: Sampling Distributions 
“~ Meanof x: HW; =H 
“ Standard deviation of x when n/N < 0.05: 6, =0/ vn 


“ zvalue for x 


“+ Population proportion: p = X/N 

“+ Sample proportion: p = x/n 

* Meanof p: yu, =p 

¢ Standard deviation of p when n/N<0.05: oO, = J pq/n 
% zvalue for p 


A 


fe. 


a) 


Chapter 5: Estimation 


Estimation of the Mean and Proportion 


“+ Margin of error for the point estimation of u: +1.966, or +1.965; 


where o. =o/Vn and s, =s/Vn 
“+ Confidence interval for u for a large sample 
X +20, if ois known 


X +zs- if ois not known 


: 
where 6, =o/Vn and Sz =s/Vn 
“* Confidence interval for t for a small sample 


xtts, where s; =s/Vn 


>, 
“ 


Margin of error for the point estimation of p 


+1.9655 where s, = /pgin 

“* Confidence interval for p for a large sample 
p2zs, where s, = Jpqin 

“ Maximum error of the estimate for 
E=z0; or 25, 


“> Determining sample size for estimating p: n = 2° 0?/E? 
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o, 


“* Maximum error of the estimate for P 


E=zs;, where Sp = [pGin 


* Determining sample size for estimating p: n = 2 pqlE? 
Chapter 6: Hypothesis Testing 


Confidence Intervals for One Population Mean 


+ Standardised version of the variable X 


“ z-interval for | (6 known, normal population or large sample) 


elses 10} 
xt Za/2* = 


vn 


“ Margin of error for the estimate of u: E = 


oO 
Se 


“+ Sample size for estimating u 


4, Oo : 
n=( a/2° 
E 


rounded up to the nearest whole number. 


¢ Studentised version of the variable x 


pom 
stVn 


¢¢ t-interval for (6 unknown, normal population or large sample) 
Ettyy2.—= with d 1 
aay ole 
a/2 ae with df=n 
Hypothesis Tests for One Population Mean 


¢ z-test statistic for H,: 1 = U, (6 known, normal population or large sample) 


ae X — Uo 
o/Vn 


¢ t-test statistic for H,: u = [1 (6 unknown, normal population or large sample) 


a 
s/Nn i. , 
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Inferences for Two Population Means 


o, 


“+ Pooled sample standard deviation 


(n, —1)s? + (ny -1)s5 
S — 
4 eh Ny — 2 


Pooled f-test statistic for H,: U, = UW, (independent samples, normal populations or large samples, 
and equal population standard deviations) 


>, 
~ 


- 4 


aes 


+ Pooled t-interval for 11, — ., (independent samples, normal populations or large samples, and equal 
population standard deviations) 


(X — X2) tty2°SpYC/m)+(/m) with df=n, +n, —2. 

“+ Degrees of freedom for non-pooled t-procedures 
ee [(s; (i )+ (s3 /n, Ne 

Ge /n, ye i Gs /n, i 


nl nN, -1 


rounded down to the nearest integer. 


“* Non-pooled t-test statistic for H,: u, = 4, (independent samples, and normal populations or large 
samples) 


x —%) 

o <a See es 

(sy /1,) + (52 /ny) 

¢* Non-pooled rinterval for 1, — , (independent samples, and normal populations or large samples): 


(%, -X,) £t,)>'V(s2 /my) + (s2/n,) with df =A. 


¢* Paired test statistic for H,: U, = UL, (paired sample, and normal differences or large sample) 


— 


d 
t =——= withdf=n-1. 


syivn 


“* Paired ¢-interval for 1, — 4, (paired sample, and normal differences or large sample) 
rf ae 
d Faia with df=n-—1. 

n 


Inferences for Population Standard Deviations 
“ x°-test statistic for H,: 6 = 6, (normal population) 


7 id ee 

x =—S with df=n-1. 
9 

“> ’-interval for a (normal population) 
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F-test statistic for H,: 6, = 0, (independent samples and normal populations) 
F = sj /s} withdf=(n,-1,n,-1). 


Inferences for Population Proportions 
“+ Sample proportion 
ee: 
io 
n 
where x denotes the number of members in the sample that have the specified attribute. 


“> z-interval for p 
Pt Za). P(l- p)/n 


(Assumption: both x and n — x are 5 or greater) 
“> Margin of error for the estimate of p 


1B Zao PU p)/n 
“+ Sample size for estimating p 
2 mae? 
n= 025{ “a2 or n= p,(1- po 22) 


rounded up to the nearest whole number (g = “educated guess”) 
“* z-test statistic for H,: p = p, 
| SET ates 
Po(l— Po)/n 


(Assumption: both np, and n(1 — p,) are 5 or greater) 


X + Xo 
nm +N 


2 
“ 


Pooled sample proportion: p, = 


> 
~ 


z-test statistic for H,: p, =p, 
a= Sa 
Jb, (- Bb, )J(/m) + U/ ny) 


(Assumptions: independent samples; x,, n,—,, X,, 1, — x, are all 5 or greater) 
“ z-interval for p, — p, 


(Bi — Po) £ Zaj2-f PiU Py) /m + Po (— p)/m 
(Assumptions: independent samples; x,, n, — x,, X,, ", — x, are all 5 or greater) 
“ Margin of error for the estimate of p, — p, 


E = Zy jaf Pi By) /m + Po — P2)/m 


“ Sample size for estimating p, — p, 


=h= 0.5 Sol? 
nN Ny E 
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or 


2 
in “ a a Za/2 
m =m = (By C~ Py )+ Pag t~Prod| ms 


rounded up to the nearest whole number (g = “educated guess”) 


Chapter 7: Curve Fitting, Regression and Correlation 
“ Simple linear regression module for population 
y=A+Bxt+e 
“+ Estimated simple linear regression model 


y=a+bx 
* S _,S_, and a 
SS, = &(x; —¥)° = Ix? -(x,)? /n 
SS,, = 2(x, —¥)(y; — ¥) = Lx, y, — Sx; Ly, /n 
SS,, = L(y, — 9)” = Ey? — (By)? In 


“ Regression equation: ) =a+bx 


i eee al 
where b =——,, a= y-bx =—(2y, —b2x;) 
n 


“* Total sum of squares: SST = X(y,- y P = SS, 
“ Regression sum of squares: SSR = X( j,- y P = SS fe ESS: 


¢ Error sum of squares: SSE = X(y,- j,)? = SS,,— SS 2 /SS,,. 


“+ Regression identity: SST = SSR + SSE 
or SSR = SST-—SSE 


% Coefficient of determination: 7 poh ms 
oS oetficient of determination: ~ = SST SS, 
+ Linear correlation coefficient 
le Sa; = Ny SS; 
= Se or p= 


S,Sy " [SS SS yy 


Inferential Methods in Regression and Correlation 
“* Population regression equation: y = a + bx 
¢* Standard error of the estimate 


[SSE 
5, = 
n—-2 
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“* Test statistic for a test of hypothesis about B 


b-B 
(ge! 
Sp 
where S, = ge 
SS x 


[SS,, —b SS, 
S. = ——— ———_— 
‘ n-2 


* Confidence interval for B, 


Dicbtaio: 


Se 
JSS. with df=n-2 


“* Confidence interval for the conditional mean of the response variable corresponding to x 
4 


p | x, —2x,/ny _, 
Whigs 1, Sea with df=n—2 


“* Confidence interval for by. 


YES, 


va peer ee 
" ass. 


Prediction interval for ve 


2, 
~ 


JIS, 


1 (X%) -%) 
h Sy =S,,{1+—+——— 
Ca %p \ n SS. 


¢» Prediction interval for an observed value of the response variable corresponding to x, 


: | Go x, / eee 
Vp tlys2 Se yt Se . with df=n—2 


xx 


“ Test statistic for H,: p = 0 


(oS with df=n—2 
l-r- 


n-2 
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“> Test statistic for a correlation test for normality 
R = 2Xx;W; 


ss 


where x and w denote observations of the variable and the corresponding normal scores, 


respectively. 


Chapter 8: chi-square Tests 
“> Expected frequency for a category for a goodness-of-fit test 
E=np 
“+ Degrees of freedom for a goodness-of-fit test 
df=k—1 
where k is the number of categories. 
“ Expected frequency for a cell for an independence or homogeneity test 


E= (row total)(column total) 
Nl 
“+ Degrees of freedom for a test of independence or homogeneity 
df=(R—1)(C-1) 
where R and C are the total number of rows and columns, respectively, in the contingency table. 
“+ Test statistic for a goodness-of-fit test and a test of independence or homogeneity 


Dy One 
Der 
“+ Confidence interval for the population variance 6” 


(n-1)s? (n—1)s” 
ere CO anaeee 
Xa/2 X1-a/2 


Test statistic for a test of hypothesis about 6” 


K? 
~ 


2_(n-Ds* 
= 
Chapter 9 : Analysis of Variance 
Let k = the number of different samples (or treatments) 
n, = the size of sample i 
T, = the sum of the values in sample i 
n= the number of values in all samples =n, + n, + i 
Xx = the sum of the values in all samples = T, + fie a Oe es 
Xr’ = the sum of the squares of values in all samples 
“* For the F-distribution 
Degrees of freedom for the numerator = k — 1 


Degrees of freedom for the denominator = n — k 
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“* Between-samples sum of squares 


ys eh 2 \ 9 
ssp-[ TE... > (e253) 


ny Ho nz A 


¢ Within-samples sum of squares 


2 2 2 
eee (EEE, 
: m Mm 


o, 
Xd 


Total sum of squares 


2 
SST =SSB+Ssw=se- 2)” 
n 


“» Variance between samples 


MSB = BEB 
al 
“> Variance within samples 
sw = = 
is 
“> Test statistic for a one-way ANOVA test 
ee 
MSW 


O09 


Answers to Selected Problems 


Table 1.1 


10/30= 03352 


9/30 = 0.3 
11/30 = 0.3667 
1.0 
(c) 33% 
(d) 710% 
(e) See Fig. 1.1. 
1) 3 
oO 10 
5 10 2 
=) 
o 
i“ 5 
0 
X ny 4 Z 
Category 


Fig. 1.1: Bar graph for the frequency distribution 
{2 (a) and (b) See Table 1.2. 


Table 1.2 


6/15 =0.4 
3/15 = 0.3333 


4/15 = 0.2667 
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(c) 40% of the faculty members are in favour of this issue. 
13 (d) 


14 


hs 
NO 


Frequency 


(Sy (ee) ops de) 


0 il 2 3 4 5 
Vehicles owned 


Fig. 1.4: Bar graph vehicles owned 


1.5 (a) and (c) —see Table 1.5 
Table 1.5 


Percentage 


24 
30.5 to less than 43.5 : 36 
43.5 to less than 56.5 28 

12 
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(b) Yes, each class has a width of 13. 
(d) 24+36=60% of the employees is 43 years old or younger. 
1.6 (a) and (b) See Table 1.6 


Table 1.6 


Frequency | Relative frequency | Percentage 


Number of machines 
manufactured 
Dile23 
24-26 
27-29 

30-32 
33-35 


(co) See RiewG 


mM NH WwW 
on Oo 


Percentage 
a 


21-23 24-26 27-29 30-32 33-35 
Number of machines manufactured 


Fig. 1.6 


(d) For 26.67% of the days the number of machines manufactured is in the interval 27 to 29. 
1.7 (a) and (b) 


Table 1.7 


Relative frequenc 


F i 7/15=0.467 | 46.7 
A 5 5/15=0.333 | 333 
AS 3 3/15 =0.2 


(c) 46.7% of the faculty in this sample is full professors. 
(2) Seekig. 1.7. 


Fig. 1.7 
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18 (a) and (b): The frequency distribution are given in Table 1.8. 


Table 1.8 


(c) 34.6% of the elements in this sample belong to category 2. 
(d) 30.8% + 34.6% = 65.4% of the elements belong to categories 1 and 3 in this sample. 
lee (a) and (b): The frequency distribution, relative frequencies and percentage for all classes are 
shown in Table 1.9. 


Table 1.9 


Number of oil changes | Frequency | Relative frequency | Percentage 
21-23 5 


24-26 ) 
27-29 2, 
30-32 6 


33-35 


(c) The histogram and the polygon for the percentage distribution are shown in Fig.1.9. 


Percentage 


21-23 24-26 27-29 30-32 33-35 
Number of oil changes 


Fig. 1.9: Histogram and polygon for the percentage distribution 


(d) 20% of the days the oil changes were made in the interval 30-32. 
1.10 (a) The cumulative frequency, cumulative relative frequency and cumulative percentage 
distribution table are presented in Table 1.10. 


Table 1.10 


Collision | Cumulative | Cumulative relative | Cumulative 


claims $ percentage 
1-1000 a2 
1-2000 62.5 
1-3000 a5 
1-4000 91.7 
1-5000 100 
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(b) The ogive for the cumulative percentage distribution is shown in Fig.1.10. 
(c) From Fig.1.10, the percentage of claimants in the sample who received $2500 or less is 
approximately 78%. 


91.7 100 
400 87.5 


80 18 


Cumulative percentage 
>) 
oO 


0 1000 2000 3000 4000 5000 
Collision claim in dollars 


Fig. 1.10: Ogive for the cumulative percentage distribution 
eg (a) and (b): Refer to the Table 1.11 given below: 
Table 1.11 


Automobile repair costs | Frequency | Relative | Percentage 
(dollars) frequenc 


1—1500 40 
1501-3000 36 


3001-4500 
4501-6000 : 20 


(ec) See Fig. 1.11, 


Relative Frequency 


1—1500 1501-3000 3001-4500 4501-6000 
Collision repair costs (dollars) 


Fig. 1.11: Relative frequency versus collision repair costs (dollars) 


(d) The class boundaries of the fourth class are $4500.50 and $6000.50. The width of the class is 
$1500. 


la (a) and (b): See Table 1.12. 
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Table 1.12 


Number of 
unemployed adults 


l 
1~70 


1.13 (a) and (b): See Table 1.13. 


Annual compensation 
thousands of dollars 
1 to less than 1,000 
1,000 to less than 2,000 


2,000 to less than 3,000 27.6 

3,000 to less than 4,000 3.4 

4,000 to less than 5,000 10.4 
100 


1.14 (a) and (0): See Table 1.14. 
Number of children | Frequency 
less than 18 years age 


0 9 
1 8 
2 11 
3 Z 


a a a ae 


(c) 11+2=13 families have 2 or 3 children less than 18 years of age. 
(d) See Fig. 1.14. 


15 
11 
@ 10 2 8 
(= 
oO 
= 
[SE 
© 
Le 
5 
2 
0 
0 1 2 3 


Number of children 


Fig. 1.14 
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1.15 Refer to the solution of problem 1.5. 
(a) and (b): See Table 1.15. 


Table 1.15 
Age (years) | Cumulative frequency Cumulative Cumulative 
relative frequenc 
18 to 30 12 
18 to 43 30 
18 to 56 44 
18 to 69 50 


(c) 100—60=40% of the employees are 44 years of age or older. 
(aj) See Fie. 1.5, 
(e) About 52% of the employees are 40 years old or younger. 


a 120 
© 
i= 100 100 
O 
= 80 88 
Qa 
g 60 4 52 60 
® 40 
= 
E 20 
oO 

0 

0 20 40 60 80 

Age (years) 
Fig. 1.15 


1.16 Refer to the solution of problem 1.6. 


Table 1.16 


Li 


Number of machines 
manufactured 

1 to less than 1,000 

1,000 to less than 2,000 

2,000 to less than 3,000 

3,000 to less than 4,000 

4,000 to less than 5,000 
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1.18 7 % B 7 1 es 
8 192 81 8 81 = 
2) | 3% Ss 9 8% «95 TI 
10 |42 &6& 6 10 |42 6 #4 
10 6-28 lt 12 7 
Stem and leaf display Stem and leaf display in increasing order 

1.19 Oo) ilS way ES Ti 7 
1 Oo 2 a 3. 9 0 Z oy ane a 
a | 3. UT 6 9 2 2 Va 2 6 as 
Sema 7 2 Seciliee 4 a 
a Vit» 3 4 |3 u 
5 10 See 
6 |4 : 6 14 
Stem and leaf display Stem and leaf display in increasing order 

1.20 


Excellent 
20% 


1.21 


486 // Probability and Statistics for Scientists and Engineers I 


p22 
Leaves 
4 
5 
4 
| 
5 
0 
I 
A 
123 
Leaves 
1.24 
[Stem[ ——sLeaves” 
28 
86 | 50 | 40 
E25 (a) 


(b) 


O-1 | 89) * | 73 09 
2-3 | 81 | 87 | 35 87 
4-6 | 73°12 | 5307) 74 


Stem-and-leaf display by group 


78 17 89 75 
* 117179 | 55 | 54] 57 | 88 
54 | 89 | 84114] 69 | * | 89 | * 


RRP™ 
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1.26 Mean = 5.6 hours, median = 6 and mode 0 and 6 
127. Mean = 25.28, median = 25.8 and mode = 24 
1.28 Mean = 78.42, median = 77.0 and no mode 
1.29 Mean = 5.70, median = 5.7 and mode = 5.7 
1.30 Mean = 27.5, median = 27.5 and mode = 23 
131 Range = 11, variance = 10.788 and standard deviation = 3.28 
1.32 Range = 13, variance = 19.822 and standard deviation = 4.452 
133 Range = 26, variance = 72.278 and standard deviation = 8.50 
1.34 Range = 12, mean = 9.4, variance = 11.4 and standard deviation = 3.3764 
35 Variance = 113.3444 and standard deviation = 10.6463 
1.36 Range = 18, variance = 39.5667 and standard deviation = 6.2902 
137 Standard deviation: Data set 1 = 14.64 and Data set 2 = 14.64 
1.38 Mean = 67.5, variance = $2,432.20 and standard deviation = $49.32 
1.39 Mean = 9.5897, variance = 39.7256 and standard deviation = 6.3028 
1.40 Mean = 28.4, variance = 210.6531 and standard deviation = 14.5139 
1.41 Mean = 3.4, variance = 0.8990 and standard deviation = 0.9482 
1.42 Mean = 24.65, variance = 122.7462 and standard deviation = 11.08 
1.43 Mean = 1.7875, variance = 1.0049 and standard deviation = 1 
1.44 (a) Q,=69.5, O, = 76.5, O, = 85.5, IOR = 16. The value of 80 lies between Q, and Q.. 
(Ga 90 
(c) _ percentile rank of 81 = 66.67% 
1.45 (a) Q=35,2,=40, 0, =43,JOR=8 
(b) P,.=45 
(c) Percentile rank of 39 = 36.67% 
1.46 (a0 = 24,0 —2).9,0,=31,JOR=7 
(b) Less than 29 for approximately 65% of the days. 
(c) 76.67% 
1.47 (a) Q=69, QO, =78, QO, = 84, JOR = 15; 81 lies between Q, and Q, 
(Op Pao, 
(c) 57.89% of the scores were below 82 
1.48 (a) O=73.5,0 49,0. 80) 
(b) IOR=65 
(c) Min=46,0=73.5, O, =79, Q, = 80 and Max = 83 
149 (a) O=4,0=7,0,=12 
(b) IOR=8 
(c) Min=1,0=4, O,=7, Q, = 12 and Max = 25 
1.50 (amo 105.07 1,5, and 0, = 86.5 
(a) 10R—16 
1.51 (ja 5020,—3/,and'0,=49 
(b) IOR=185 
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152 ao 
1.53 P= 66.67% 
1.54 QO = 23, QO, = 30.5, and @, = 36.5 hours. 
155 (a) Min=6, Q=23, Q, = 36.5, Max = 64 hours. 
(b) IQR= 13.5 hours. 
(c) Lower limit = 2.75 hours; Upper limit = 56.75 hours. 
(d) 64 isa potential outlier. 
Ey, (a) 0-88, C =1315 and 0 = 154 
(b) IQR=66 
(c) No potential outliers 
(d) 
88 131.5 154 


=th| (= 253 


—11 88 lsiko 154 
Fig. 1.57: Box plot 
1.58 Skewed to the left. 
1.59 


Box-and-whisker plot 
1.6] The data are nearly symmetric. 


2A (a) needs engine-oil change, tune-up and new tires 
(b) needs tune-up, new tires but no engine-oil change 
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(c) needs engine-oil change but neither tune-up nor new tires 

(d) need engine-oil change and new tires 

(e) need tune-up, but no new tires 

(f) does not need oil change. 
oe (a) 5 

(b) 1 and 2 together 

(c) 3,5 and 6 together 

(d) 1,3, 4 and 6 together 
pes (a) 


(b) Mhas 170 systems, S has 50 systems 
24 (a) P(S)=0.56 
(b) S=(AorBorC) 
(c) P(A)=0.01, P(B)=0.14, P(C)=0.41 
(d) P(S)=0.56 
25 (a) 093 
(b) 0.56 
2.6 (a) P(F)=0.50; P(D) = 0.10; PF and D) = 0.05 
(b) PF orD)=0.55 


(c) P(M)=0.50 
27 0.96875 
28 S= {MM, MF, FM, FF} 


29 (a) FF 
(b) {FF, FA, AF} (c) {FA, AF} 


2.10 0.3334 
Pl 0.1667 
212 (a) 18 


(b) Sample space and sample points 
ite? 13,14, 1 Syh6, 
Ieoe2.2 3,24,25,20)2 
13253 9.3'4; 35,3 6,3 
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Z3 
2.14 


2.15 
2.16 


ZF 
2.18 
2.19 
2.20 
2.21 


(c) 
(a) 
(a) 
(b) 
(c) 


(a) 
(b) 
(a) 


(a) 
(b) 
(c) 


1,4 2,43,44,45,46,4 
135-295.5 4,5 55,65 
16 2,6 3,6 4,6 5,6 6,6 
Probability = 1/36 


(i) 3/8 (ii) 12 

(i) 1/6 (ii) 1/12 (b) 223 
0.2125 

0.0962 


21.25% of the employees are in their 50’s; 9.62% of the rank R, employees are in their 50’s 
P(R, and A,) = 0.2655; P(R,) = 0.2706; P(4,/R,) = 0.0962 

P(M/S,) = 0.0888 

P(S,/M,) = 0.6666 

P(M,/F,) = 0.6122; P(F, and M,) = 0.2449 

0.0316 

0.075 

0.9688 

0.001 

0.999 

The intersection of the events A, B and C is denoted as ABC in Fig. 2.21. 


Final outcomes 


Third patient 
P(ABC) = 0.001 


Second patient 


P(ABC) = 0.09 


P(ABC) = 0.09 
First patient 


P(ABC) = 0.081 
Start ( ) 


P(ABC) = 0.09 


P(ABC) = 0.081 
P(ABC) = 0.081 


P(ABC) = 0.729 


Fig. 2.21: Tree diagram 


// Answers to Selected Problems // 491 
OO svers to Selected Problems | 491 


222 S={MMM, MMW, MWM, MWW, WMM, WMW, WWM, WWW} 
225 P(woman) = 0.4; P(man) = 0.6; Sum of the probabilities = | 
2.24 (a) See Table 2.24 (Frequency distribution). 
Table 2.24 
Number of design engineers | F requency | Relative frequency | 
0 35 0.035 
1 210 0.210 
2 379 0.379 
3) 0.270 
4 or more 0.106 4 
1.0 
(b) 0.379 
(c) 0.106 


225 (a) not independent, 
(b) not mutually exclusive 
2.26 (a) 3/13 
(b) 5/8 
PAPA (aja0.21 
(b) 0.24 
(c) not independent 
2.28 (a) 23/40 
2.29 (a) 0.156 


(b) 0.1265 
230 (a) (0.025 (ii) 0.183 (b) 0 
231 0.2651 
232 0.6667 
233 (a) 05428 

(b) 0.8437 
2.34 0.9516 


235 (a) mutually exclusive 
(b) not mutually exclusive 


2:36 P( B ) = 0.003; P(B) = 0.097; P(C) = 0.0941; P(D) = 0.9941: P( D ) = 0.0059; P( C ) = 0.0029 
238 (a) 58 
(b) OA 
239 0.486 
2.40 0.9882 
241 50% 


242 (a) 0.029 
(b) 0.3103 
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243 
2.44 
2.45 


2.46 
2.47 


2.48 


2.49 


2.50 
2.51 


252 


255 


2.54 
2.55 


2.56 


(a) 
(b) 


(a) 
(b) 
(c) 
(d) 
(a) 
(b) 


(a) 
(b) 
(c) 


(a) 
(b) 


(a) 
(b) 
(a) 
(b) 


7/10 
6/100 
S= {FF, FA, AF, AA} 
0.84 
0.135 
0.04 
0.96 
0.5 

0.33 

1/3 
P(A) = 5/9, P(B/A) = 1/2, P(AB) = 5/18 
0.4437 
0.99992 
0.4545 
0.2424 
0.3030 
0.3182 
0.25 
0.36 
0.7308 
13 

2/3 
0.3025 


A tree diagram showing the various probabilities along with the probabilities as branches is 


shown in Fig. 2.56. 
P(A/B,) = 0.15 


- US 
P(B,) = 0.55 P(not A/B,) = 0.85 


P(A/B,) = 0.4 
P(B,) = 0.25 
Start (B.) 
Europe 


P(not A/B,) = 0.6 


P(B,) = 0.2 P(A/B,) = 0.6 


Japan 


P(not A/B,) = 0.4 


Compact (0.55)(0.15) = 0.0825 


Compact (0.25)(0.4) = 0.10 


Compact (0.2)(0.6) = 0.12 


Fig. 2.56: Tree diagram 
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(c) 0.33058 
257 (a) 0.595 
(b) 0.605 
(c) 0.0987 
258 (a) 065 
(b) 0.23077 


(c) Here, we like to find P(d,/A), by applying the Baye’s formula. 


P(Ald;)-P(d;) _ (0.6)(0. 
P(d,/A) == = Dae = 0.23077 


P(A/d,) = 0.7 


P(d,) = 0.25 


Door 2 
P(not A/d,) 


Sta 
P(A/d,) = 0.6 


Door 3 
P(not A/d,) 


P(d,) = 0.25 P(A/d,) = 0.5 


P(not A/d,) 


Fig. 2.58: Tree diagram 


259 0.2384 
2.60 0.2857 


Escape (0.7)(0.25) = 0.175 


Escape (0.8)(0.25) = 0.20 


Escape (0.6)(0.25) = 0.150 


Escape (0.5)(0.25) = 0.125 


CHAPTER —- 3 
Syl (a) Table 3.1 gives the probability distribution. 


Table 3.1: Probability distribution 


11/90 = 0.12 
20/90'— 0:22 


23/90 = 0.25 
22/90 = 0.24 
14/90 = 0.15 


(b) approximate (c) (i) 0.22, (ii) 0.86, 


(iii) 0.59, (iv) 034 
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32 (a) 
(b) 
(c) 
(d) 

33 

34 (a) 
(b) 

a5 (a) 


0.12 
041 
0.43 
0.36 
u= 3.25; o = 1.26 
w= 1.30 
P= 151;0=1.23 


Table 3.5(a) 


Figure 3.5: shows the line graph of the probability distribution of Table 3.5(a). 


(b) 
3.6 (a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 
a0 (a) 
(b) 
(c) 


P(x) 
0.40 


0.30 


0.20 


0.10 


Fig. 3.5 


(i) 0.35, (ii) 0.70, (iii) 0.65, (iv) 035 
Discrete random variable 


Continuous random variable 
Discrete random variable 
Continuous random variable 
Continuous random variable 
Discrete random variable 
Discrete random variable 
Yes 

0.5 

0.45 
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3.8 6o=1.20 


3.9 u= 12.5; 0= 1.36 
3.10 p= 3.5; o? = 2.92 
Sl 0.39347 

512 900 hours 

3.13 


(3 4 6 


Fig. 3.13(a): Density function 


Fig. 3.13(b): Distribution function 


3.14 (a) 0.5941 


(b) 0.1992 
a5 (a) Vo 

(b) 39/64 

(c) 1/16 
3.16 0.0819 
o17 p= 0.4167; 0” = 0.0645 
3.18 (a) p=(at b)/2; 0? =(b-ay/12 
3.19 xX,, = 6.931 sec. 


3.20 =A; variance = A? 
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iva (a) 26! 

(b) 720 
3.22 315 
Be, 


Table 3.23: Number of possible acronyms with n letters 


Number of possibilities 
without repetition 


n | Number of possibilities 
with repetition 


676 650 

17.576 15,600 

. 456,976 358,800 
11,881,376 7,893,600 


3.24 (a) 90,000 
(b) 27,216 
(c) The first digit can be chosen in 9 ways, the second in 8 ways, the third in 7 ways and the 
fourth in 6 ways. Then 9.8.7.6 = 3,024 numbers can be formed. 


325 3003 

3.26 (a) 04118 
(b) 0.068 
(c) 0.0229 
(d) 0.7941 

327 120 

3.28 12 

3.29 24 

3.30 7/10 

331 (a) 0.57664 
(b) 0.35161 


(e472. 107 
sey) (a) 462 

(b) 462 
3:33 (a) 2520 

(b) 1400 

(c) 504 


3.35 (a) 350 
(b) 150 
(c) 105 
3.36 (a) 140,608 
(b) 132,600 
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337 56 
3.38 o 
3.39 6 
3.40 10 


341 (a) P(X>1)=0.9576 
(b) P(X>1)=0.5429 


3.42 0.44 
3.43 P(1) = 0.347; P(2) = 0.069; P(3) = 0.005; P(0) = 0.579 
3.44 0.2759 
3.45 (a) n=S0and p=0.1 
(f)) On? 
(Cc) easiex 10-* 
3.46 P(return) = 0.19 
3.47 0.0086 
3.48 (a) 0.228 
(b) 0.9655 
3.49 (a) 0.00356 
(b) 0.0444 
3.50 Oai22 
551 (a) 04575 
(b) 0.082 
3.92 0.423 
B53 (a) 0.1755 
(b) 0.0858 
(c) 0.2873 
3.54 (a) 0.2240 
(b) 0.1992 
5:55 P(X $ 3) = 0.433 (using Poisson distribution with A = 4) 
3.56 0.453 
Sei) (a) 0.0041 
(b) 0.0467 
(c) 0.2765 
(d) 0.8203 
3.58 (a) 04096 
(b) 04096 
(c) 0.1536 


359 (a) 018 
(b) 0323 
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3.60 


3.61 
3.62 
3.63 
3.64 
305 
3.66 


3.67 


3.68 


3.69 


3.70 


371 
3.72 


a0 
3.74 
ee) 
3.76 
Bee 
3.78 
em) 
3.80 


(a) 
(b) 


(a) 
(b) 
(c) 
(a) 
(b) 
(c) 
(a) 
(b) 
(c) 
(d) 
(a) 
(b) 
(c) 
(d) 
(e) 
(a) 
(b) 
(c) 
(d) 
(e) 


(a) 
(b) 
(c) 


(a) 
(b) 


2000 C,(0.001)°(0.999)'”” 

1 — [2000 C,(0.001)°(0.999 + 2000 C,(0.001)'(0.999)"” + 2000 C,(0.001 °(0.949)'98] 
0.228 

21.12% 

11.48% 

0.9772 

0.8185 

0.0668 1 

0.86638 

o = 0.000214 

0.8849 

0.1151 

0.7262 

0.6772 

0.6772 

0.0668 

0.2560 

0.99865 

0.98422 

0.9973 

0.34134 

0.00135 

0.5 

P(X <z)=0.10 and z =—1.28 

z=2.58 

z=1.0 

P(Z <z)=0.8 + 0.10749 = 0.90749 and z = 1.33. 
99.73% 

0.8849 

0.1151 

0.7262 

0.8415 

x=9.771 

P(Xs 1) = 0.923 (Hypergeometric); P(X < 1) = 0.914 (Binomial) 
P(X $3) = 0.429; P(X < 3) = 0.433 (Poisson) 
P(2<X < 14)=0.3716 

0.7748 

0.755 

0.1937 

0.1839 
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3.81 0.7717 (Hypergeometric); 0.7763 (Binomial) 
3.82 P(X=7 or 8)= 0.1611 
3.83 (a) 0.7286 
(b) 0.6318 
3.84 0.1131 
3.85 (a) 0.0484 
(b) 0.0192 
(c) 0.1525 
3.86 P(X S$ 491.1 or X= 50.9) < 1/4 
3.87 88.8% 
3.89 75% 
3.90 (a) 75% 
(b) 84% 
(c) 89% 
3.91 (a) b=26.83 
(b) o=1.58 
3.92 95% 
Bee 0.0062 
3.94 0.0136 
3.95 0.7698 
3.96 1/150 
Se ol 0.0808 
3.98 Mean = 3.02; Standard deviation = 0.042 


41 (a) 


Table 4.1: Sampling distribution of x 


Probability 


(b) The mean is p = 8/3. | 3 
(c) The mean of the sampling distribution of proportion is 8/3, and the population mean is also 
8/3. In general, the distribution of sample means will have a mean equal to the population 
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mean. The sample means tend to target the population mean instead of systematically being 


43 


44 


45 


46 


47 
48 


49 


4.10 


4.11 
4.12 


The distribution of X and _Y are shown in Fig. 4.3. 


(a) 


(b) 


(a) 
(b) 
(a) 
(b) 


too low or too high. 


P(X <95) = P(z < -2.5) = 0.006210 


4 5 


Fig. 4.3 The distribution of x and x 


Hy =h=60kg OF = 


u- ="t=60kg o, = 


0.96407 
0.86218 
67 
—2.67 


Approximately 83.32% of the samples will have the mean between 588 and 605 hours. 


(a) 
(b) 


(a) 


(b) 
(a) 
(b) 
(c) 
(d) 


(a) 


(b) 


0.21644 
0.00154 


= 55 


is N(O, 1) 


N 


0.9545 
0.01224 
0.88814 
0.78870 
0.006210 
0.11507 
0.02 mm 


Age 
0.02 
(i) 0.30854 (ii) 0.158655 


is N(0, 1) 


“- 


(iii) 0.53281 


4.13 


4.14 


4.15 


4.16 


4.17 


4.18 


4.19 


4.20 
421 


422 
4.23 
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(a) 0.039204 
(b) 0.710856 
(c) 0.173609 
There is a 96.25% probability that the estimate will be within 6 months of the true mean. 

100 
(a) oO; =—=—=10 

100 

(b) The probability is approximately 98.76% that the sampling error made in estimating the 


(a) 


(b) 


(a) 
(b) 


(c) 


(d) 


(e) 


(a) 


(b) 


(c) 


population means length of time employed by all men with one job by the mean length of 
time employed of a sample of 100 such men will be at most 25 weeks. 


oO 16 
= 80 and o, = —= =——= 
Re 
z=——— ig approximately (0, 1) 
(i) 0.02275 (ii) 0.00135 (iii) 0.5 


If all possible sample means for sample size 81 were found, their distribution would be 
approximately normal with mean $45,000 and standard deviation $1000, 

If all possible sample mean for samples of size 289 were found, their distribution would be 
approximately normal with mean $45,000 and standard deviation $529.42. 


The answer is no because these results follow from the central limit theorem (Chapter 3, 
Section 3.8), which implies that for large samples (n > 30), the distribution of the sample mean 
will be approximately normal regardless of the distribution of the original variable. 


68.27% of samples of size 81 will have a mean that is within $1000 of the population mean, 
1.€., 68.27% of the samples will have a sampling error less than $1000 for samples of size 81. 
94.12% of samples of size 289 will have a mean that is within $1000 of the population mean, 
i.e., 94.12% of the samples will have a sampling error less than $1000 for samples of size 289. 


0.621% 


S, === =u = 0.9 

“Vn 100 
The answer is it does not invalidate the result found in part (a). The sample size of 100 is 
large enough so that the central limit theorem (Chapter 3, Section 3.8) applies and the sampling 
distribution will be approximately normal. 
0.99913 
0.09428 
np = 49(0.20) = 9.8 and ng = 49(0.8) = 39.2. Since np and ng are both greater than 5, the 
sampling distribution of p is approximately normal. 


0.0914 
0.9463 
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dhol tA INA E AE Aah at et he Cth AAA AN fae Ny td tlt 


5.1 3514.67 to 3617.33 
Sp 49,000 to 50,196 
53 (a) n=25 
(oye) n= 35 (ii) n=49 
54 20.45 <p <21.55 
Se) 3.65 <u<435 
5.6 19.3156 <u< 20.6844 
Shi 70.06 << 75.94 
5.8 0.5089 <u < 0.5091 
Pe, = 11.76 
59 ¥-— sys X+—- 
5.10 53.458 <p < 58.342 
Sullil 19.2308 < uw < 32.1692 
SH 8.667 <u < 16.267 
Syl! (a) 18.646 to 83.175 
(b) 21.527 to 66.692 
(c) 23.239 to 59.886 
5.14 1.325<0? < 9.333 
5.15 oO” < 24.7875 
5.16 (a) 0.163 
(b) (i) 0.0635 <0? <0.9807 (ii) 0.2324<o0? <1.2129 
Sully 0.9674 <6 < 1.9327 
5.18 1.876 <0 $< 5.833 


5.19 (a) 0.121<p<0.179 
(b) 0.141 <p<0.159 
(c) 0.147<p<0.153 

5.20 0.3697 to 0.4303 

5.21 (a) 0.669145 to 0.73085 
(b) 0.66334 to 0.73665 
(c) 0.651754 to 0.74824 
(d) Yes 

nwa, (a) p =0.90; +0.0294 
(b) 0.8706 to 0.9294 

5.23 (a) 66.82% to 93.18% 


(b) lowering the confidence level and increasing the sample size 
5.24 0.57853 to 0.621471 
Do 12.16 to 17.84 years 
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5.26 5.2% to 24.8% 


5.27 Core |t = = 35.81 
5.28 2.593 <p, — MW, < 5.407 
pes) 0.60 to 2.4 policies 
5.30 —0.31 to 0.07 


a3 (a) ~1.923 Sp, —p, <0.9225 
(6b) Confidence interval is broadened 


5.32 46.282 <p, —u, $ 53.718 
533 146.2 to 253.8 g 
534 4,342 <p, —p, < 0.342 
5.35 0.72 Sp, —b, $6.72 
536 3.6759 <p, — pW, < 5.6759 
537 4.7428 to 15.314 
5.38 —18.866 to —2.634 
539 0.9396 to 18.4732 
540 ~7.2017 to -2.7983 
5Al 1.2493 to 3.4174 
542 —1.0332 to 5.5668 
Co 

543 0.1395 < oe 1.96 

Op 

o 
5.44 0.3244 <—> < 2.58285 

03 2 

o- 
545 0.07592 < oe 2.86224 
O> 
G; 
546 0.609375 < 21 3 < 4.453125 
2 

547 aas < SL < 4.05 


548 (a) 0309 <2 < 1.7475 
2 


co 
(b) 0.2565 << <2.115 
0» 


C2 
oO; 
(c) 0.6124 < a 


an 
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5.49 (a) -0.2334 

(b) 0.293582 <p, —p, S$—0.173218 
550 (a) 0.025 

(b) 0.031626 to 0.081626 


551 —0.065387 <p, —p, $ 0.109787 
500 Giz Sp > p,s04777 
555 0.122774 to 0.070752 
5.54 —0.113897 to -0.046103 
55 (a) n=865 

(b) n=385 
5.56 (a) n=96 

(b) n= 136 
Spoil No 
5.58 n= 384 
259 n= 32,617 
5.60 (a) n=999 

(b) n= 1040 
5.61 n= 1849 
562 n=2122 
5.63 (a) n=96,040 

(b) n=2167 
CHAPTER-6 
6.1 (a) +2.58 

(b) 2.33 

(Cees 
62 (a) +1.96 

(b) -1.645 

(c) 1.645 
63 (a) =28) 

(b) -2.58 

(c) 258 
6.4 (a) +3.08 

(b) 2.88 

(c) 2.88 
6.5 (a) The hypotheses that needed to tested: 

Hat 75 
Ngee te: 


(b) Do not reject H, 
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6.6 Reject H, 


6.7 N=139 
6.8 Sample mean cannot be taken as population means. 
6.9 (a) Yes; H, is accepted; man time can be taken as 58.2 minutes 
(b) Do not reject H, 
6.10 Reject H,; claim of the earlier study is not true. 
6.11 Reject H,; mean population time for the product is more than 45 minutes. 
6.12 (a) Do not reject H, 
(b) (i) n=65 (ii) n=81 
6.13 Reject H, 
6.14 Reject H,; mean number of machine 1 is higher than that by machine 2. 


6.15 Reject H,; the two brands of machine parts differ in their carbon content. 
6.16 (a) reject H, 
(b) do not reject H, 
(c) reject H, 
6.17 Reject H,; mean load at failure exceeds 10 MPa. 
6.18 Reject H,; mean baking temperature exceeds 90°C. 


6.19 Reject H,; mean weekly sales of salespersons increase as a result of attending that course. 
6.20 Do not reject H,; mean of the paired differences is not different from zero. 
6.21 Reject H,; use of gasoline additive increases the gasoline mileage. 


6.22 (aj 3.85 to 17.15 
(b) Reject H,; the mean number of items produced per hour by the first machine is higher than 
by the second machine. 
6.23 Reject H,. 
6.24 Reject H,. 
6.25 Reject H,. 
6.26 Reject H,. 


O27 Reject H,; persons with a business degree are better salespersons than those with a non-business 
degree. 
6.28 Do not reject H,; the mean GPAs of all male and female college students are not different. 


6.29 Reject H, and conclude that o is not equal to 0.25 at a = 0.05. 

6.30 Reject H,; the process used in the manufacture of parts must be adjusted. 

6.31 Do not reject H,. 

6.32 Do not reject H,; variance of current seniors is not different from 150. 

6.33 Reject H,; the population variance is greater than 0.003 mm. 

6.34 Do not reject H,; 6, is equal to 4000 hours. 

6.35 Do not reject H,; sufficient evidence to warrant that o # 10. 

6.36 Do not reject H.,, both production lines have the same variance from the predetermined length. 


6.37 (a) Do not reject H,; variances are equal. 
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7.1 
ile 
73 
74 


(b) Do not reject H,; no evidence to indicate that the new voltage-regulating device has affected 
the mean value. 

Do not reject H,; no strong evidence to indicate that either gasket material results in a smaller 

variance of gasket thickness. 

Accept H,; there is no significant difference between the variances. 

Reject H,. 

Do not reject H,. 

Do not reject H,; we cannot claim that the standard deviations of surface roughness for the two 

processes are different at the 90% confidence level. 

Reject H,. 

Sufficient evidence to warrant rejection of the claim, the proportion of 0.60. 

Reject H.. 

Do not reject H,. 


Do not reject H,; no evidence that the proportion of defective bolts produced by the machine is 
reduced. 

Reject H,. 

Do not reject H,; no evidence to indicate that the attitude this year is different from that of last 
year. 


Do not reject H,; no strong evidence to support the claim that the two polishing materials are 
different. 


Do not eject H,; the data provides insufficient evidence to state that the fraction defective differ 
at & = 0.05. 


(a) 0.10 

(b) 0.4382 to 0.15618 

(c) The non-rejection region lies to the left of z= 1.96. 
(d) p=0.74545; q=0.254545 

(e) Reject H, 

Do not reject H,. 

Accept H, 


Do not reject H,; no evidence to indicate that one brand of spray is significantly different from the 
other. 
Reject H,; there is sufficient evidence to conclude that the percentage of men who sometimes 


order vegetarian meals is smaller than the percentage of women who sometimes order vegetarian 
meals. 


a=-—22.0007, b = 3.7484 
=—149.3849, b=9.1956 

a= 0.4449, b = 0.2747 

a=—47.4598, b= 11.582 
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75 a=22.917, b=3.3883 


7.6 a= 44.3333, b=0.4167 

ad a= 80.7777, b= 1.138 

78 a= 39.3327, b=0.4213 

7) a= 1.1574, b=0.2566 

gAG a= 124.0862, b=—13.5402 


7A1 (a) a= -22.0007 and b = 3.7484 
(b) s,= 30.8722 
(c) SSE=7624.7219 
(d) SST=23973.6 
(e) SSR=16348.8781 
(ff) r=0.682 
(zg) r=0.8258 
when x = 42, y = 135.4307 
712 (a) a=—149.3849 and b = 9.1956 
(b) s,=58.8065 
(c) SSE= 13832.8076 
(d) SST=125520.8333 
(e) SSR=111688.0258 
ff) r=0.8898 
(gz) r=0.9433 
when x = 53, y = 337.9811 
7.13 (a) a=0.4449 and b= 0.2747 
(b) s-— 1.0550 
(c) SSE=5.5648 
(d) SST=66 
(e) SSR=60.4352 
G) r=0.9157 
(zg) r=0.9569 
when x = 42, y = 11.9825 
7.14 (a) a=-474598 and b= 11.582 
(b) 5s = 3.3805 
(c) SSE=249.1216 
(d) SST =47682.1 
(e) SSR=47432.9784 
(ff) 1=0.9948 
(g) r=0.9974 
when x = 20, y = 184.1804 
(ALB) (a) a=22.917 and b= 3.3883 
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7.16 


15%) 


7.18 


Zag 


(b) 
(c) 
(d) 
(e) 
f) 
(g) 


(a) 
(b) 
(c) 
(d) 
(e) 
ff) 
(g) 


(a) 
(b) 
(c) 
(d) 
(e) 
—) 
(g) 


(a) 
(b) 
(c) 
(d) 
(e) 
ff) 
(g) 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 


5S =5:6669 

SSE = 256.9085 

SST = 4573.6 

SSR = 4316.6915 

r= 0.9438 

r=0.9715 

when x = 15, y= 73.7415 
a= 44.3333 and b=0.4167 
pee2.0973 

SSE = 1281.6667 

SST = 1802.5 

SSR = 520.8333 

r =0.2890 

r=0.5375 

when x = 82, y= 78.5 

a= 80.7777 and b= 1.138 
Sea 01 70 

SSE = 492.4669 

SST = 2500.6667 

SSR = 2008.1998 
r=0.8031 

r=0.8961 

when x = 45, y= 131.988 
a= 39.3327 and b = 0.4213 
5 1.3306 

SSE = 17.8917 

SST = 32.9167 

SSR = 15.0249 

P= 0.4565 

r= 0.6756 

when x = 72, y = 69.6634 
a= 1.1574 and b = 0.2556 
s, = 1.0438 

SSE = 5.4473 

SST = 58.2143 

SSR = 52.767 

r =0.9064 

r=0.9521 

when x = 41, y= 11.6778 
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7.20 


721 
U2 
723 
7.24 
25 
7.26 
14 
7.28 
ea) 
7.30 
731 
732 
(Be 
7.34 
TES) 
7.36 
fo7 
7.38 
Usk. 
7.40 
7Al 


742 


743 


744 


(a) 
(b) 
(c) 
(d) 
(e) 
) 
(g) 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


a= 124.0862 and b =—13.5402 

S,= 15.4208 

SSE = 1426.8046 

SST = 9402 

SSR = 7975.1954 

P=0.85 

r=0.92 

when 7 = 53, y = 29.3046 

1.6614 to 5.8354 

4.7038 to 13.6874 

0.1789 to 0.3705 

10.8977 to 12.2663 

2.7144 to 4.0622 - 

—0.1162 to 0.9496 

0.7410 to 1.5351 

0.0974 to 0.7451 

0.1618 to 0.3514 

—19.2615 to —7.8189 

H,: B = 0; H,: B > 0; critical value: t = 2.896; test statistic: t= 4.1419; reject aE 

H,; B= 0; H,: B> 0; critical value: t = 3.747; test statistic: t= 5.683; reject fei 

Hi: B= 0; H,: B> 0; critical value: t = 3.365; test statistic: tf = 7.369; reject De 

H,: B= 0; H,: B> 0; critical value: ¢ = 2.896; test statistic: t= 39.0283; reject H,. 

H,; B= 0; H,: B> 0; critical value: t = 2.896; test statistic: t= 11.5939; reject H,. 

H,: B = 0; H,: B > 0; critical value: t = 2.896; test statistic: += 1.8030; do not reject H,. 

H,; B= 0; H,: B > 0; critical value: t = 2.764; test statistic: t= 6.3858; reject H,,. 

H,; B = 0; H,: B > 0; critical value: t = 2.764; test statistic: t= 2.8979; reject H,. 

H,: B= 0; H,: B > 0; critical value: t = 3.365; test statistic: f= 6.9595; reject H,. 

H,: B= 0; H,: B > 0; critical value: t= 3.143; test statistic: t= —5.7911; do not reject H,. 

2.0650 to 5.4317 

Bie 0 2 B= 0; critical value of tf = 1.86; test statistic; ¢ = 4.1419; do not reject null 
hypothesis and B is positive. 

5.7458 to 12.6453 

He Bae b= 0; critical value of t = 2.132; test statistic; f = 5.6829; do not reject null 
hypothesis and B is positive. 

0.1996 to 0.3498 

HB Oi B= 0; critical value of t = 2.015; test statistic; t = 7.3646; do not reject null 
hypothesis and B is positive. 

11.03 to 12.134 

i, Bi 0; ee A, critical value of ¢ = 1.86; test statistic; t = 39.0229; do not reject null 
hypothesis and B is positive. 
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745 


746 


747 


748 


749 


7.50 


Tol 
152 
128 
7.54 
SS 
156 
eu 
7.58 
ieee 
7.60 
7.61 
7.62 
7.63 
7.64 
7.65 
7.66 
7.67 


7.68 
7.69 
7.70 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


(a) 
(b) 


2.8447 to 3.9319 

fe BB < 0; critical value of f = 1.86; test statistic; ¢ = 11.5958; do not reject null 
hypothesis and B is positive. 

—0.0132 to 0.8465 

eeee 0; i B= 0; critical value of ¢ = 1.86; test statistic; ¢ = 1.8031; do not reject null 
hypothesis and B is positive. 

0.8151 to 1.4609 

PieeB dah <0; critical value of ¢ = 1.812; test statistic; f = 6.3861; do not reject null 
hypothesis and B is positive. 

0.1579 to 0.6847 

H,: B = 0; H,: B < 0; critical value of ¢ = 1.812; test statistic; ¢ = 2.8975; do not reject null 
hypothesis and B is positive. 

0.1823 to 0.3309 

hee Oe: B= 0; critical value of t = 2.015; test statistic; f = 6.9539; do not reject null 
hypothesis and B is positive. 

—18.0831 to 8.9973 

H,: B = 0; H,: B < 0; critical value of t = 1.942; test statistic; t = -5.7911; reject the null 
hypothesis and B is positive. 

86.8864 to 183.9750 

196.601 to 479.3612 

9.6917 to 14.2733 

177.8678 to 190.4931 

65.9838 to 81.4991 

62.1632 to 94.8368 

124.3482 to 139.6277 

66.9186 to 70.4082 

9.4268 to 13.9288 

4.0352 to 54.5739 

b= 0.8671 and m= 1.3471 

b = 0.5686 and m = 1.5903 

b= 0.3990 and m = 0.9057 

b =2.9256 and m= 1.3777 

b= 18.9926 and m = 0.4983 

a= 1.2675, b = 1.2475 and c=—0.0727; S_, = 0.6520; r= 0.9653 


a = ~291.9285, b = 417.3647 and c = -146.3105; S, = 8.2232; 5, = 0.0406; r = 0.9975; 
S,, = 0.1163 

a= 24.1204, b = 1.9639, c= 0.0323, S,, = 1.5211, S, = 288, S. = 4.6277, P = 0.9839, r= 0.9919 
a= 2.5095, b = 1.2, c = 0.7333, S,, = 0.2024, S, = 332.0571, S = 0.1638, 7 = 0.9995, r= 0.9998 


a= 18.9414, b=—0.0313, c=0.0014, Wee 8 218s S = 67.5086, 7? = 0.6812, r=0.8253 
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q11 21.043 to 249.8185 


Laos 32.545 to 643.4171 

713 7.1513 to 16.8138 

7.74 164.4228 to 203.9381 

I> 53.2073 to 94.2757 

7.76 33.0006 to 123.9994 

ald 108.4735 to 155.5024 

7.78 64.6135 to 74.7133 

7.79 6.9051 to 16.4505 

7.80 —33.1963 to 91.8055 

7.81 a= 14.4217, b= 8.9904, c =-5.6087, S,= 1051.3356, S = 4.3466, S = 0.7244, °° = 0.9959, 
r=0.9979 

7.82 a= 71.3427, b= 45932, c=~2.7062, S = 728.5556, S,=50.742,S = 8.457, 1° =0.9304, r= 0.9645 

7.83 a= 2.2561, 6 =3.8171, c = 5.3333, S = 577.84, S = 4.2007, S. = 0.6001, 7 = 0.9927, r= 0.9964 

7.84 a= 23,9103, b= 4.691, c=-9.591, S = 850.36, S = 1.9791, 5 = 0.2827, r =0:9977, r= 0.9988 

7.85 a= 29.4487, b =—5.7449, c= 19.8449, S = 3463.61, S = 2.0574, D5 = 0.2959 — Good: 
r= 0.9997 : 

7.86 a= 107.6961 and b =-0.0143 

7.87 a= 2.7077 and b = 0.0369 

7.88 a= 20.0771 and b = 0.0325 

7.89 a= 63.5770 and b = 0.3548 

7.90 a= 1.9077 and b = 1.5146 

791 a= 4.0672 and b = 0.7823 

7.92 a= 2.3946 and b = 177.5202 

7.93 a= 13.2275 and b = 37.5036 

7.94 a= 1.8515 and b = 284.4318 

95 a=—0.0359 and b = 7.6495; r= 0.9998 

7.96 a=25.0550; b = 3.2229; Sum of squares of errors = 269.5377 

97 a= 3.3385; b = 2.9169; Sum of squares of errors = 7.8421 

7.98 a=-88.6813; b = 7.4678; Sum of squares of errors = 205.2290 

7.99 a= 1.2045; b= 0.2599; Sum of squares of errors = 1.6846 

7.100 a=—40.8141; b = 11.1799; Sum of squares of errors = 308.9377 

7.101 a= 21.5949; b = 3.5681; Sum of squares of errors = 271.3282 

7.102 a= 22.4556; b = 3.13; Sum of squares of errors = 86.0916 

7.103 a= 5.2116; b= 2.8682; Sum of squares of errors = 5.6054 

7.104 a=—88.3833; b = 7.5646; Sum of squares of errors = 17572.5447 

7.105 a=—1.2576; b = 0.3166; Sum of squares of errors = 1.1108 

7.106 a=—49.1549; b= 11.7656; Sum of squares of errors = 41.9142 


FLNGME a= 18.6429; b = 3.4286; Sum of squares of errors = 14.1888 
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7.108 y= 1.01299 + 0.85113x + 0.839187 
I 5 
7.112 $,(x) = 1, 9,%) = x- 0.5, 6,0¢) =x’ -x+ a and o,(x) = 2° -— 1.527 + 0.6x - 0.05. 
2 ae 
TAI 0,(%) = 1, 6,0) =x—- 1, 6,0) =x" - 2x + 3° and ,(x) =2° — 3x° + 5°75 
: 11 ; 
7.114 ,) = 1, 6,0) =x-2,0,@) =x - 4x+ an and 0,(x) = x —6x° + 11.4x-6.8 


8.1 Critical value: x? = 11.345; test statistic: y° = 17.6; reject H,. 


8.2 Critical value: x? = 5.991; test statistic: x” = 1.062; do not reject H,. 
8.3 Critical value: x? = 13.277; test statistic: y°* = 16.3; reject H,. 
8.4 Critical value: x? = 7.378; test statistic: x” = 9.158; reject H,. 


8.5 Critical value: x? = 11.071; test statistic: y* = 2.48; do not reject H,. 
8.6 Expected frequencies: 105, 59.5, 10.5, 75, 42.5 and 7.5. 
8.7 Critical value: x? = 9.21; test statistic: x” = 5.2; do not reject H,. 


8.8 Critical value: x7 = 5.991; test statistic: ¥* = 199.955; reject H,. 

8.9 Critical value: x = 9.21; test statistic: y° = 63.0848; reject H,. 

8.10 Critical value: x” = 7.378; test statistic: x7 = 7.7749; reject H.,. 

8.11 Critical value: %° = 7.378; test statistic: x* = 5.239; do not reject H,. 
8.12 Critical value: x’ = 4.605; test statistic: y7 = 14.5939; reject H,, 

8.13 Critical value: %* = 16.812; test statistic: x7 = 10.1218; do not reject H,. 
8.14 Critical value: y’ = 6.635; test statistic: x7 = 15.1294; reject H,. 

8.15 Critical value: x” = 7.378; test statistic: 77 = 1.9911; do not reject H,. 
8.16 0.01829 to 0.05805; 0.13524 to 0.2409. 

8.17 Test statistic: ¥* = 22.2; do not reject H,. 

8.18 Test statistic: x* = 26.565; reject H,. 

8.19 Test statistic: x” = 64.96; reject H,. 

8.20 Test statistic: x* = 25.6; do not reject H,. 

8.21 0.003364 to 0.011972; 0.058 to 0.10942 

8.22 2528.589 to 11652.842; 50.285 to 107.948 

8.23 Critical value: x’ = 42.98; test statistic: x? = 48; reject H., 

8.24 Critical value: x? = (32.852, 8.907); test statistic: x? = 21.913; do not reject Fis 
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CHAPTER-9 


CHARTER). rash So ee ae 


D1 
92 


9.3 
94 
95 
9.6 
27 
9.8 
39 
9.10 


F= 1.2742 
Test statistic F = 1.2742 and F ica) = 6-93. We fail to reject the null hypothesis. The means of three 
populations are equal. 
Test statistic F = 10.72 > the critical value of F = 3.16. We reject the null hypothesis. 
Critical of F = 4.75 and test statistic F = 2.4291. Do not reject H,. 
Critical of F = 3.47 and test statistic F = 20.6516. Reject the null hypothesis. 
Critical of F = 6.93 and test statistic F = 1.5164. Do not reject H, 
Critical of F = 3.59 and test statistic F = 5.8068. Reject H, 
Critical of F = 3.34 and test statistic F = 7.0634. Reject H, 
Critical of F = 3.24 and test statistic F = 9.3653. Reject H, 
Critical of F = 3.49 and test statistic F = 6.6758. Reject lek 


Numerical Descriptive 
Measures 


Probability 


Random Variables and 
Probability Distributions 


Sampling Distributions 
Estimation 
Hypothesis Testing 


Curve Fitting, Regression 
and Correlation 


Chi-Square Tests 
Analysis of Variance 
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